Overview

Dataset statistics

Number of variables21
Number of observations45476
Missing cells26218
Missing cells (%)2.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.3 MiB
Average record size in memory168.0 B

Variable types

Numeric10
Categorical11

Alerts

Genres has a high cardinality: 4068 distinct valuesHigh cardinality
OriginalLanguage has a high cardinality: 93 distinct valuesHigh cardinality
Overview has a high cardinality: 44234 distinct valuesHigh cardinality
ProductionCompanies has a high cardinality: 22667 distinct valuesHigh cardinality
ProductionCountries has a high cardinality: 2390 distinct valuesHigh cardinality
ReleaseDate has a high cardinality: 17334 distinct valuesHigh cardinality
Tagline has a high cardinality: 20269 distinct valuesHigh cardinality
Title has a high cardinality: 42197 distinct valuesHigh cardinality
Director has a high cardinality: 17573 distinct valuesHigh cardinality
MovieCharacter has a high cardinality: 40180 distinct valuesHigh cardinality
ActorName has a high cardinality: 42678 distinct valuesHigh cardinality
Budget is highly overall correlated with Revenue and 1 other fieldsHigh correlation
Popularity is highly overall correlated with VoteCountHigh correlation
Revenue is highly overall correlated with Budget and 2 other fieldsHigh correlation
VoteCount is highly overall correlated with Popularity and 1 other fieldsHigh correlation
Return is highly overall correlated with Budget and 1 other fieldsHigh correlation
OriginalLanguage is highly imbalanced (67.4%)Imbalance
ProductionCountries is highly imbalanced (58.3%)Imbalance
Tagline has 25078 (55.1%) missing valuesMissing
Popularity is highly skewed (γ1 = 29.21506573)Skewed
Return is highly skewed (γ1 = 138.3340992)Skewed
Tagline is uniformly distributedUniform
Title is uniformly distributedUniform
Budget has 36490 (80.2%) zerosZeros
Revenue has 37972 (83.5%) zerosZeros
Runtime has 1535 (3.4%) zerosZeros
VoteAverage has 2947 (6.5%) zerosZeros
VoteCount has 2849 (6.3%) zerosZeros
Return has 39998 (88.0%) zerosZeros

Reproduction

Analysis started2023-06-13 15:11:30.412539
Analysis finished2023-06-13 15:12:16.736040
Duration46.32 seconds
Software versionpandas-profiling v3.6.6
Download configurationconfig.json

Variables

Budget
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1223
Distinct (%)2.7%
Missing100
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean4232604.4
Minimum0
Maximum3.8 × 108
Zeros36490
Zeros (%)80.2%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:17.232623image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile25000000
Maximum3.8 × 108
Range3.8 × 108
Interquartile range (IQR)0

Descriptive statistics

Standard deviation17439860
Coefficient of variation (CV)4.1203614
Kurtosis66.634491
Mean4232604.4
Median Absolute Deviation (MAD)0
Skewness7.1183385
Sum1.9205866 × 1011
Variance3.041487 × 1014
MonotonicityNot monotonic
2023-06-13T12:12:17.642622image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 36490
80.2%
5000000 286
 
0.6%
10000000 259
 
0.6%
20000000 243
 
0.5%
2000000 242
 
0.5%
15000000 226
 
0.5%
3000000 223
 
0.5%
25000000 206
 
0.5%
1000000 197
 
0.4%
30000000 190
 
0.4%
Other values (1213) 6814
 
15.0%
ValueCountFrequency (%)
0 36490
80.2%
1 25
 
0.1%
2 14
 
< 0.1%
3 9
 
< 0.1%
4 8
 
< 0.1%
5 8
 
< 0.1%
6 5
 
< 0.1%
7 4
 
< 0.1%
8 5
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
380000000 1
 
< 0.1%
300000000 1
 
< 0.1%
280000000 1
 
< 0.1%
270000000 1
 
< 0.1%
260000000 3
 
< 0.1%
258000000 1
 
< 0.1%
255000000 1
 
< 0.1%
250000000 10
< 0.1%
245000000 2
 
< 0.1%
237000000 1
 
< 0.1%

Genres
Categorical

Distinct4068
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
Drama
4998 
Comedy
3621 
Documentary
 
2713
NoGenre
 
2481
Drama, Romance
 
1301
Other values (4063)
30362 

Length

Max length84
Median length68
Mean length15.950567
Min length3

Characters and Unicode

Total characters725368
Distinct characters41
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2367 ?
Unique (%)5.2%

Sample

1st rowAnimation, Comedy, Family
2nd rowAdventure, Fantasy, Family
3rd rowRomance, Comedy
4th rowComedy, Drama, Romance
5th rowComedy

Common Values

ValueCountFrequency (%)
Drama 4998
 
11.0%
Comedy 3621
 
8.0%
Documentary 2713
 
6.0%
NoGenre 2481
 
5.5%
Drama, Romance 1301
 
2.9%
Comedy, Drama 1135
 
2.5%
Horror 974
 
2.1%
Comedy, Romance 930
 
2.0%
Comedy, Drama, Romance 593
 
1.3%
Drama, Comedy 532
 
1.2%
Other values (4058) 26198
57.6%

Length

2023-06-13T12:12:17.982246image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
drama 20255
20.8%
comedy 13181
13.5%
thriller 7619
 
7.8%
romance 6733
 
6.9%
action 6592
 
6.8%
horror 4670
 
4.8%
crime 4305
 
4.4%
documentary 3921
 
4.0%
adventure 3494
 
3.6%
science 3042
 
3.1%
Other values (37) 23540
24.2%

Most occurring characters

ValueCountFrequency (%)
r 71563
 
9.9%
a 61822
 
8.5%
e 60748
 
8.4%
m 53101
 
7.3%
51876
 
7.2%
o 51022
 
7.0%
, 48053
 
6.6%
i 39670
 
5.5%
n 38157
 
5.3%
y 28510
 
3.9%
Other values (31) 220846
30.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 524833
72.4%
Uppercase Letter 100606
 
13.9%
Space Separator 51876
 
7.2%
Other Punctuation 48053
 
6.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 71563
13.6%
a 61822
11.8%
e 60748
11.6%
m 53101
10.1%
o 51022
9.7%
i 39670
7.6%
n 38157
7.3%
y 28510
 
5.4%
c 27977
 
5.3%
t 26210
 
5.0%
Other values (12) 66053
12.6%
Uppercase Letter
ValueCountFrequency (%)
D 24176
24.0%
C 17489
17.4%
A 12020
11.9%
F 9746
9.7%
T 8389
 
8.3%
R 6735
 
6.7%
H 6068
 
6.0%
M 4830
 
4.8%
S 3046
 
3.0%
G 2483
 
2.5%
Other values (7) 5624
 
5.6%
Space Separator
ValueCountFrequency (%)
51876
100.0%
Other Punctuation
ValueCountFrequency (%)
, 48053
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 625439
86.2%
Common 99929
 
13.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 71563
11.4%
a 61822
 
9.9%
e 60748
 
9.7%
m 53101
 
8.5%
o 51022
 
8.2%
i 39670
 
6.3%
n 38157
 
6.1%
y 28510
 
4.6%
c 27977
 
4.5%
t 26210
 
4.2%
Other values (29) 166659
26.6%
Common
ValueCountFrequency (%)
51876
51.9%
, 48053
48.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 725368
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 71563
 
9.9%
a 61822
 
8.5%
e 60748
 
8.4%
m 53101
 
7.3%
51876
 
7.2%
o 51022
 
7.0%
, 48053
 
6.6%
i 39670
 
5.5%
n 38157
 
5.3%
y 28510
 
3.9%
Other values (31) 220846
30.4%

OriginalLanguage
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct93
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
en
32202 
fr
 
2437
it
 
1528
ja
 
1349
de
 
1078
Other values (88)
6882 

Length

Max length10
Median length2
Mean length2.019153
Min length2

Characters and Unicode

Total characters91823
Distinct characters35
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)< 0.1%

Sample

1st rowen
2nd rowen
3rd rowen
4th rowen
5th rowen

Common Values

ValueCountFrequency (%)
en 32202
70.8%
fr 2437
 
5.4%
it 1528
 
3.4%
ja 1349
 
3.0%
de 1078
 
2.4%
es 992
 
2.2%
ru 822
 
1.8%
hi 508
 
1.1%
ko 444
 
1.0%
zh 408
 
0.9%
Other values (83) 3708
 
8.2%

Length

2023-06-13T12:12:18.246207image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
en 32202
70.8%
fr 2437
 
5.4%
it 1528
 
3.4%
ja 1349
 
3.0%
de 1078
 
2.4%
es 992
 
2.2%
ru 822
 
1.8%
hi 508
 
1.1%
ko 444
 
1.0%
zh 408
 
0.9%
Other values (83) 3708
 
8.2%

Most occurring characters

ValueCountFrequency (%)
e 34635
37.7%
n 33018
36.0%
r 3630
 
4.0%
f 2835
 
3.1%
i 2388
 
2.6%
t 2250
 
2.5%
a 2055
 
2.2%
s 1652
 
1.8%
j 1350
 
1.5%
d 1323
 
1.4%
Other values (25) 6687
 
7.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 91594
99.8%
Uppercase Letter 216
 
0.2%
Decimal Number 10
 
< 0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 34635
37.8%
n 33018
36.0%
r 3630
 
4.0%
f 2835
 
3.1%
i 2388
 
2.6%
t 2250
 
2.5%
a 2055
 
2.2%
s 1652
 
1.8%
j 1350
 
1.5%
d 1323
 
1.4%
Other values (16) 6458
 
7.1%
Decimal Number
ValueCountFrequency (%)
0 4
40.0%
8 2
20.0%
2 1
 
10.0%
6 1
 
10.0%
1 1
 
10.0%
4 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
N 108
50.0%
L 108
50.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 91810
> 99.9%
Common 13
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 34635
37.7%
n 33018
36.0%
r 3630
 
4.0%
f 2835
 
3.1%
i 2388
 
2.6%
t 2250
 
2.5%
a 2055
 
2.2%
s 1652
 
1.8%
j 1350
 
1.5%
d 1323
 
1.4%
Other values (18) 6674
 
7.3%
Common
ValueCountFrequency (%)
0 4
30.8%
. 3
23.1%
8 2
15.4%
2 1
 
7.7%
6 1
 
7.7%
1 1
 
7.7%
4 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 91823
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 34635
37.7%
n 33018
36.0%
r 3630
 
4.0%
f 2835
 
3.1%
i 2388
 
2.6%
t 2250
 
2.5%
a 2055
 
2.2%
s 1652
 
1.8%
j 1350
 
1.5%
d 1323
 
1.4%
Other values (25) 6687
 
7.3%

Overview
Categorical

Distinct44234
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
NoOverview
 
1038
No overview found.
 
133
No Overview
 
7
 
5
Released
 
3
Other values (44229)
44290 

Length

Max length1000
Median length791
Mean length316.12519
Min length1

Characters and Unicode

Total characters14376109
Distinct characters429
Distinct categories25 ?
Distinct scripts13 ?
Distinct blocks21 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44173 ?
Unique (%)97.1%

Sample

1st rowLed by Woody, Andy's toys live happily in his room until Andy's birthday brings Buzz Lightyear onto the scene. Afraid of losing his place in Andy's heart, Woody plots against Buzz. But when circumstances separate Buzz and Woody from their owner, the duo eventually learns to put aside their differences.
2nd rowWhen siblings Judy and Peter discover an enchanted board game that opens the door to a magical world, they unwittingly invite Alan -- an adult who's been trapped inside the game for 26 years -- into their living room. Alan's only hope for freedom is to finish the game, which proves risky as all three find themselves running from giant rhinoceroses, evil monkeys and other terrifying creatures.
3rd rowA family wedding reignites the ancient feud between next-door neighbors and fishing buddies John and Max. Meanwhile, a sultry Italian divorcée opens a restaurant at the local bait shop, alarming the locals who worry she'll scare the fish away. But she's less interested in seafood than she is in cooking up a hot time with Max.
4th rowCheated on, mistreated and stepped on, the women are holding their breath, waiting for the elusive "good man" to break a string of less-than-stellar lovers. Friends and confidants Vannah, Bernie, Glo and Robin talk it all out, determined to find a better way to breathe.
5th rowJust when George Banks has recovered from his daughter's wedding, he receives the news that she's pregnant ... and that George's wife, Nina, is expecting too. He was planning on selling their home, but that's a plan that -- like George -- will have to change with the arrival of both a grandchild and a kid of his own.

Common Values

ValueCountFrequency (%)
NoOverview 1038
 
2.3%
No overview found. 133
 
0.3%
No Overview 7
 
< 0.1%
5
 
< 0.1%
Released 3
 
< 0.1%
Recovering from a nail gun shot to the head and 13 months of coma, doctor Pekka Valinta starts to unravel the mystery of his past, still suffering from total amnesia. 3
 
< 0.1%
King Lear, old and tired, divides his kingdom among his daughters, giving great importance to their protestations of love for him. When Cordelia, youngest and most honest, refuses to idly flatter the old man in return for favor, he banishes her and turns for support to his remaining daughters. But Goneril and Regan have no love for him and instead plot to take all his power from him. In a parallel, Lear's loyal courtier Gloucester favors his illegitimate son Edmund after being told lies about his faithful son Edgar. Madness and tragedy befall both ill-starred fathers. 3
 
< 0.1%
No movie overview available. 3
 
< 0.1%
Adaptation of the Jane Austen novel. 3
 
< 0.1%
A few funny little novels about different aspects of life. 3
 
< 0.1%
Other values (44224) 44275
97.4%

Length

2023-06-13T12:12:18.515115image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
the 138082
 
5.6%
a 98889
 
4.0%
and 75259
 
3.1%
to 73321
 
3.0%
of 69574
 
2.8%
in 48143
 
2.0%
is 36500
 
1.5%
his 36165
 
1.5%
with 23902
 
1.0%
her 21484
 
0.9%
Other values (97092) 1828430
74.6%

Most occurring characters

ValueCountFrequency (%)
2406350
16.7%
e 1365872
 
9.5%
a 940505
 
6.5%
t 934766
 
6.5%
i 852552
 
5.9%
o 830911
 
5.8%
n 822601
 
5.7%
s 767854
 
5.3%
r 745312
 
5.2%
h 600810
 
4.2%
Other values (419) 4108576
28.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 11158386
77.6%
Space Separator 2406388
 
16.7%
Uppercase Letter 393041
 
2.7%
Other Punctuation 312824
 
2.2%
Decimal Number 42223
 
0.3%
Dash Punctuation 36767
 
0.3%
Close Punctuation 10100
 
0.1%
Open Punctuation 10077
 
0.1%
Final Punctuation 4556
 
< 0.1%
Initial Punctuation 882
 
< 0.1%
Other values (15) 865
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1365872
12.2%
a 940505
 
8.4%
t 934766
 
8.4%
i 852552
 
7.6%
o 830911
 
7.4%
n 822601
 
7.4%
s 767854
 
6.9%
r 745312
 
6.7%
h 600810
 
5.4%
l 478816
 
4.3%
Other values (142) 2818387
25.3%
Uppercase Letter
ValueCountFrequency (%)
A 42751
 
10.9%
T 35968
 
9.2%
S 31126
 
7.9%
M 23954
 
6.1%
B 23699
 
6.0%
C 22803
 
5.8%
H 19429
 
4.9%
W 18652
 
4.7%
I 16798
 
4.3%
D 16311
 
4.1%
Other values (77) 141550
36.0%
Other Letter
ValueCountFrequency (%)
6
 
4.8%
6
 
4.8%
5
 
4.0%
4
 
3.2%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
م 2
 
1.6%
Other values (76) 88
70.4%
Other Punctuation
ValueCountFrequency (%)
, 133443
42.7%
. 124794
39.9%
' 31121
 
9.9%
" 11661
 
3.7%
: 3299
 
1.1%
? 2759
 
0.9%
; 2493
 
0.8%
! 1543
 
0.5%
/ 765
 
0.2%
& 453
 
0.1%
Other values (12) 493
 
0.2%
Nonspacing Mark
ValueCountFrequency (%)
́ 4
12.1%
ి 4
12.1%
3
9.1%
3
9.1%
3
9.1%
̈ 3
9.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
Other values (4) 5
15.2%
Decimal Number
ValueCountFrequency (%)
1 9748
23.1%
0 8265
19.6%
9 6405
15.2%
2 4251
10.1%
5 2440
 
5.8%
8 2379
 
5.6%
3 2342
 
5.5%
4 2176
 
5.2%
7 2131
 
5.0%
6 2086
 
4.9%
Spacing Mark
ValueCountFrequency (%)
11
40.7%
4
 
14.8%
3
 
11.1%
3
 
11.1%
ि 2
 
7.4%
2
 
7.4%
1
 
3.7%
ி 1
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 35244
95.9%
881
 
2.4%
633
 
1.7%
5
 
< 0.1%
4
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
® 45
70.3%
14
 
21.9%
¦ 2
 
3.1%
° 2
 
3.1%
1
 
1.6%
Math Symbol
ValueCountFrequency (%)
~ 20
50.0%
+ 11
27.5%
= 6
 
15.0%
| 2
 
5.0%
1
 
2.5%
Open Punctuation
ValueCountFrequency (%)
( 10024
99.5%
[ 50
 
0.5%
{ 2
 
< 0.1%
1
 
< 0.1%
Currency Symbol
ValueCountFrequency (%)
$ 317
96.4%
£ 10
 
3.0%
1
 
0.3%
1
 
0.3%
Space Separator
ValueCountFrequency (%)
2406350
> 99.9%
  36
 
< 0.1%
  2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 10048
99.5%
] 50
 
0.5%
} 2
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
3847
84.4%
690
 
15.1%
» 19
 
0.4%
Initial Punctuation
ValueCountFrequency (%)
672
76.2%
192
 
21.8%
« 18
 
2.0%
Control
ValueCountFrequency (%)
106
96.4%
’ 3
 
2.7%
 1
 
0.9%
Modifier Symbol
ValueCountFrequency (%)
´ 25
65.8%
` 12
31.6%
¯ 1
 
2.6%
Format
ValueCountFrequency (%)
31
60.8%
­ 20
39.2%
Other Number
ValueCountFrequency (%)
½ 8
50.0%
¹ 8
50.0%
Connector Punctuation
ValueCountFrequency (%)
_ 19
100.0%
Line Separator
ValueCountFrequency (%)
7
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Paragraph Separator
ValueCountFrequency (%)
2
100.0%
Modifier Letter
ValueCountFrequency (%)
ʼ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 11546195
80.3%
Common 2824495
 
19.6%
Cyrillic 4587
 
< 0.1%
Greek 648
 
< 0.1%
Devanagari 77
 
< 0.1%
Telugu 30
 
< 0.1%
Hiragana 20
 
< 0.1%
Tamil 19
 
< 0.1%
Han 10
 
< 0.1%
Hangul 9
 
< 0.1%
Other values (3) 19
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1365872
11.8%
a 940505
 
8.1%
t 934766
 
8.1%
i 852552
 
7.4%
o 830911
 
7.2%
n 822601
 
7.1%
s 767854
 
6.7%
r 745312
 
6.5%
h 600810
 
5.2%
l 478816
 
4.1%
Other values (132) 3206196
27.8%
Common
ValueCountFrequency (%)
2406350
85.2%
, 133443
 
4.7%
. 124794
 
4.4%
- 35244
 
1.2%
' 31121
 
1.1%
" 11661
 
0.4%
) 10048
 
0.4%
( 10024
 
0.4%
1 9748
 
0.3%
0 8265
 
0.3%
Other values (71) 43797
 
1.6%
Cyrillic
ValueCountFrequency (%)
о 470
 
10.2%
е 404
 
8.8%
а 373
 
8.1%
н 323
 
7.0%
и 299
 
6.5%
т 265
 
5.8%
р 240
 
5.2%
с 218
 
4.8%
в 173
 
3.8%
л 161
 
3.5%
Other values (46) 1661
36.2%
Greek
ValueCountFrequency (%)
α 60
 
9.3%
ο 55
 
8.5%
τ 43
 
6.6%
ι 36
 
5.6%
η 36
 
5.6%
ν 34
 
5.2%
ε 31
 
4.8%
ρ 31
 
4.8%
π 30
 
4.6%
ς 30
 
4.6%
Other values (33) 262
40.4%
Devanagari
ValueCountFrequency (%)
11
 
14.3%
6
 
7.8%
6
 
7.8%
5
 
6.5%
4
 
5.2%
3
 
3.9%
3
 
3.9%
3
 
3.9%
3
 
3.9%
3
 
3.9%
Other values (21) 30
39.0%
Hiragana
ValueCountFrequency (%)
4
20.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%
Telugu
ValueCountFrequency (%)
ి 4
13.3%
3
10.0%
3
10.0%
3
10.0%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
Other values (6) 6
20.0%
Tamil
ValueCountFrequency (%)
3
15.8%
2
10.5%
2
10.5%
2
10.5%
2
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (3) 3
15.8%
Han
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Hangul
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Thai
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Arabic
ValueCountFrequency (%)
م 2
50.0%
ہ 1
25.0%
ت 1
25.0%
Inherited
ValueCountFrequency (%)
́ 4
57.1%
̈ 3
42.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14358111
99.9%
Punctuation 7270
 
0.1%
None 5930
 
< 0.1%
Cyrillic 4587
 
< 0.1%
Devanagari 77
 
< 0.1%
Telugu 30
 
< 0.1%
Hiragana 20
 
< 0.1%
Tamil 19
 
< 0.1%
Letterlike Symbols 14
 
< 0.1%
CJK 10
 
< 0.1%
Other values (11) 41
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2406350
16.8%
e 1365872
 
9.5%
a 940505
 
6.6%
t 934766
 
6.5%
i 852552
 
5.9%
o 830911
 
5.8%
n 822601
 
5.7%
s 767854
 
5.3%
r 745312
 
5.2%
h 600810
 
4.2%
Other values (82) 4090578
28.5%
Punctuation
ValueCountFrequency (%)
3847
52.9%
881
 
12.1%
690
 
9.5%
672
 
9.2%
633
 
8.7%
303
 
4.2%
192
 
2.6%
31
 
0.4%
7
 
0.1%
5
 
0.1%
Other values (4) 9
 
0.1%
None
ValueCountFrequency (%)
é 1552
26.2%
ä 294
 
5.0%
á 293
 
4.9%
ö 250
 
4.2%
í 243
 
4.1%
è 209
 
3.5%
ü 178
 
3.0%
ı 165
 
2.8%
ó 164
 
2.8%
ç 158
 
2.7%
Other values (141) 2424
40.9%
Cyrillic
ValueCountFrequency (%)
о 470
 
10.2%
е 404
 
8.8%
а 373
 
8.1%
н 323
 
7.0%
и 299
 
6.5%
т 265
 
5.8%
р 240
 
5.2%
с 218
 
4.8%
в 173
 
3.8%
л 161
 
3.5%
Other values (46) 1661
36.2%
Letterlike Symbols
ValueCountFrequency (%)
14
100.0%
Devanagari
ValueCountFrequency (%)
11
 
14.3%
6
 
7.8%
6
 
7.8%
5
 
6.5%
4
 
5.2%
3
 
3.9%
3
 
3.9%
3
 
3.9%
3
 
3.9%
3
 
3.9%
Other values (21) 30
39.0%
Alphabetic PF
ValueCountFrequency (%)
4
100.0%
Hiragana
ValueCountFrequency (%)
4
20.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%
Diacriticals
ValueCountFrequency (%)
́ 4
57.1%
̈ 3
42.9%
Telugu
ValueCountFrequency (%)
ి 4
13.3%
3
10.0%
3
10.0%
3
10.0%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
Other values (6) 6
20.0%
Tamil
ValueCountFrequency (%)
3
15.8%
2
10.5%
2
10.5%
2
10.5%
2
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (3) 3
15.8%
Arabic
ValueCountFrequency (%)
م 2
50.0%
ہ 1
25.0%
ت 1
25.0%
Hangul
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Number Forms
ValueCountFrequency (%)
2
100.0%
Modifier Letters
ValueCountFrequency (%)
ʼ 2
100.0%
Thai
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
CJK
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
100.0%
Currency Symbols
ValueCountFrequency (%)
1
50.0%
1
50.0%
Specials
ValueCountFrequency (%)
1
100.0%

Popularity
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct43731
Distinct (%)96.4%
Missing100
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean2.9264576
Minimum0
Maximum547.4883
Zeros40
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:18.792501image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.02079775
Q10.3888395
median1.1304545
Q33.6916945
95-th percentile11.063627
Maximum547.4883
Range547.4883
Interquartile range (IQR)3.302855

Descriptive statistics

Standard deviation6.0096718
Coefficient of variation (CV)2.0535653
Kurtosis1923.6882
Mean2.9264576
Median Absolute Deviation (MAD)0.9676215
Skewness29.215066
Sum132790.94
Variance36.116156
MonotonicityNot monotonic
2023-06-13T12:12:19.027496image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 × 10-656
 
0.1%
0.000308 42
 
0.1%
0 40
 
0.1%
0.00022 39
 
0.1%
0.000844 38
 
0.1%
0.001177 38
 
0.1%
0.000578 38
 
0.1%
0.002001 27
 
0.1%
0.003013 21
 
< 0.1%
0.00353 19
 
< 0.1%
Other values (43721) 45018
99.0%
(Missing) 100
 
0.2%
ValueCountFrequency (%)
0 40
0.1%
1 × 10-656
0.1%
2 × 10-66
 
< 0.1%
3 × 10-66
 
< 0.1%
4 × 10-65
 
< 0.1%
5 × 10-61
 
< 0.1%
6 × 10-62
 
< 0.1%
7 × 10-61
 
< 0.1%
8 × 10-66
 
< 0.1%
9 × 10-62
 
< 0.1%
ValueCountFrequency (%)
547.488298 1
< 0.1%
294.337037 1
< 0.1%
287.253654 1
< 0.1%
228.032744 1
< 0.1%
213.849907 1
< 0.1%
187.860492 1
< 0.1%
185.330992 1
< 0.1%
185.070892 1
< 0.1%
183.870374 1
< 0.1%
154.801009 1
< 0.1%
Distinct22667
Distinct (%)49.8%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
MissingValue
11896 
Metro-Goldwyn-Mayer (MGM)
 
742
Warner Bros.
 
540
Paramount Pictures
 
505
Twentieth Century Fox Film Corporation
 
439
Other values (22662)
31354 

Length

Max length609
Median length476
Mean length33.778894
Min length2

Characters and Unicode

Total characters1536129
Distinct characters294
Distinct categories17 ?
Distinct scripts6 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20300 ?
Unique (%)44.6%

Sample

1st rowPixar Animation Studios
2nd rowTriStar Pictures, Teitler Film, Interscope Communications
3rd rowWarner Bros., Lancaster Gate
4th rowTwentieth Century Fox Film Corporation
5th rowSandollar Productions, Touchstone Pictures

Common Values

ValueCountFrequency (%)
MissingValue 11896
 
26.2%
Metro-Goldwyn-Mayer (MGM) 742
 
1.6%
Warner Bros. 540
 
1.2%
Paramount Pictures 505
 
1.1%
Twentieth Century Fox Film Corporation 439
 
1.0%
Universal Pictures 320
 
0.7%
RKO Radio Pictures 247
 
0.5%
Columbia Pictures Corporation 207
 
0.5%
Columbia Pictures 146
 
0.3%
Mosfilm 145
 
0.3%
Other values (22657) 30289
66.6%

Length

2023-06-13T12:12:19.315362image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
missingvalue 11896
 
6.3%
films 9455
 
5.0%
pictures 9267
 
4.9%
productions 9059
 
4.8%
film 6679
 
3.5%
entertainment 5154
 
2.7%
corporation 2189
 
1.2%
company 1769
 
0.9%
warner 1478
 
0.8%
bros 1411
 
0.7%
Other values (18617) 131220
69.2%

Most occurring characters

ValueCountFrequency (%)
144110
 
9.4%
i 130730
 
8.5%
e 106540
 
6.9%
n 101865
 
6.6%
a 89039
 
5.8%
s 86459
 
5.6%
o 85292
 
5.6%
r 83547
 
5.4%
t 83433
 
5.4%
l 63160
 
4.1%
Other values (284) 561954
36.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1105978
72.0%
Uppercase Letter 222757
 
14.5%
Space Separator 144115
 
9.4%
Other Punctuation 45099
 
2.9%
Decimal Number 4347
 
0.3%
Dash Punctuation 4331
 
0.3%
Open Punctuation 4328
 
0.3%
Close Punctuation 4327
 
0.3%
Math Symbol 662
 
< 0.1%
Other Letter 140
 
< 0.1%
Other values (7) 45
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 130730
11.8%
e 106540
9.6%
n 101865
9.2%
a 89039
8.1%
s 86459
 
7.8%
o 85292
 
7.7%
r 83547
 
7.6%
t 83433
 
7.5%
l 63160
 
5.7%
u 55647
 
5.0%
Other values (102) 220266
19.9%
Other Letter
ValueCountFrequency (%)
9
 
6.4%
8
 
5.7%
6
 
4.3%
5
 
3.6%
5
 
3.6%
5
 
3.6%
5
 
3.6%
5
 
3.6%
4
 
2.9%
3
 
2.1%
Other values (62) 85
60.7%
Uppercase Letter
ValueCountFrequency (%)
P 27880
12.5%
F 26362
11.8%
M 25257
11.3%
C 20585
 
9.2%
V 14957
 
6.7%
S 11911
 
5.3%
E 9746
 
4.4%
A 9547
 
4.3%
T 9356
 
4.2%
B 9001
 
4.0%
Other values (52) 58155
26.1%
Other Punctuation
ValueCountFrequency (%)
, 37354
82.8%
. 5671
 
12.6%
& 764
 
1.7%
/ 645
 
1.4%
' 451
 
1.0%
" 133
 
0.3%
! 36
 
0.1%
% 18
 
< 0.1%
: 9
 
< 0.1%
@ 5
 
< 0.1%
Other values (6) 13
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 1034
23.8%
1 712
16.4%
0 641
14.7%
3 556
12.8%
4 481
11.1%
9 205
 
4.7%
6 195
 
4.5%
5 178
 
4.1%
8 173
 
4.0%
7 172
 
4.0%
Open Punctuation
ValueCountFrequency (%)
( 4318
99.8%
[ 9
 
0.2%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 4317
99.8%
] 9
 
0.2%
1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
144110
> 99.9%
  5
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 4329
> 99.9%
2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 661
99.8%
| 1
 
0.2%
Other Symbol
ValueCountFrequency (%)
° 23
92.0%
2
 
8.0%
Final Punctuation
ValueCountFrequency (%)
3
50.0%
» 3
50.0%
Other Number
ValueCountFrequency (%)
² 1
50.0%
½ 1
50.0%
Control
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
« 3
100.0%
Format
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1328332
86.5%
Common 207252
 
13.5%
Cyrillic 373
 
< 0.1%
Hangul 115
 
< 0.1%
Greek 31
 
< 0.1%
Han 26
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 130730
 
9.8%
e 106540
 
8.0%
n 101865
 
7.7%
a 89039
 
6.7%
s 86459
 
6.5%
o 85292
 
6.4%
r 83547
 
6.3%
t 83433
 
6.3%
l 63160
 
4.8%
u 55647
 
4.2%
Other values (99) 442620
33.3%
Hangul
ValueCountFrequency (%)
9
 
7.8%
8
 
7.0%
6
 
5.2%
5
 
4.3%
5
 
4.3%
5
 
4.3%
5
 
4.3%
5
 
4.3%
4
 
3.5%
3
 
2.6%
Other values (43) 60
52.2%
Common
ValueCountFrequency (%)
144110
69.5%
, 37354
 
18.0%
. 5671
 
2.7%
- 4329
 
2.1%
( 4318
 
2.1%
) 4317
 
2.1%
2 1034
 
0.5%
& 764
 
0.4%
1 712
 
0.3%
+ 661
 
0.3%
Other values (37) 3982
 
1.9%
Cyrillic
ValueCountFrequency (%)
и 34
 
9.1%
о 28
 
7.5%
а 26
 
7.0%
л 22
 
5.9%
н 20
 
5.4%
м 19
 
5.1%
т 17
 
4.6%
с 16
 
4.3%
е 16
 
4.3%
ь 16
 
4.3%
Other values (36) 159
42.6%
Greek
ValueCountFrequency (%)
ο 3
 
9.7%
ν 3
 
9.7%
Ε 2
 
6.5%
λ 2
 
6.5%
η 2
 
6.5%
ι 2
 
6.5%
τ 2
 
6.5%
ρ 2
 
6.5%
Κ 2
 
6.5%
έ 1
 
3.2%
Other values (10) 10
32.3%
Han
ValueCountFrequency (%)
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (9) 9
34.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1529899
99.6%
None 5711
 
0.4%
Cyrillic 373
 
< 0.1%
Hangul 113
 
< 0.1%
CJK 26
 
< 0.1%
Punctuation 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
144110
 
9.4%
i 130730
 
8.5%
e 106540
 
7.0%
n 101865
 
6.7%
a 89039
 
5.8%
s 86459
 
5.7%
o 85292
 
5.6%
r 83547
 
5.5%
t 83433
 
5.5%
l 63160
 
4.1%
Other values (77) 555724
36.3%
None
ValueCountFrequency (%)
é 3176
55.6%
ó 416
 
7.3%
á 317
 
5.6%
í 173
 
3.0%
ü 154
 
2.7%
ñ 150
 
2.6%
ô 140
 
2.5%
ä 137
 
2.4%
è 136
 
2.4%
ö 132
 
2.3%
Other values (76) 780
 
13.7%
Cyrillic
ValueCountFrequency (%)
и 34
 
9.1%
о 28
 
7.5%
а 26
 
7.0%
л 22
 
5.9%
н 20
 
5.4%
м 19
 
5.1%
т 17
 
4.6%
с 16
 
4.3%
е 16
 
4.3%
ь 16
 
4.3%
Other values (36) 159
42.6%
Hangul
ValueCountFrequency (%)
9
 
8.0%
8
 
7.1%
6
 
5.3%
5
 
4.4%
5
 
4.4%
5
 
4.4%
5
 
4.4%
5
 
4.4%
4
 
3.5%
3
 
2.7%
Other values (42) 58
51.3%
Punctuation
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
1
 
14.3%
CJK
ValueCountFrequency (%)
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (9) 9
34.6%

ProductionCountries
Categorical

HIGH CARDINALITY  IMBALANCE 

Distinct2390
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
US
17846 
Missing values
6214 
GB
2235 
FR
 
1653
JP
 
1356
Other values (2385)
16172 

Length

Max length98
Median length2
Mean length4.5812077
Min length2

Characters and Unicode

Total characters208335
Distinct characters42
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1764 ?
Unique (%)3.9%

Sample

1st rowUS
2nd rowUS
3rd rowUS
4th rowUS
5th rowUS

Common Values

ValueCountFrequency (%)
US 17846
39.2%
Missing values 6214
 
13.7%
GB 2235
 
4.9%
FR 1653
 
3.6%
JP 1356
 
3.0%
IT 1029
 
2.3%
CA 840
 
1.8%
DE 749
 
1.6%
IN 735
 
1.6%
RU 734
 
1.6%
Other values (2380) 12085
26.6%

Length

2023-06-13T12:12:19.576927image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
us 21147
34.1%
values 6214
 
10.0%
missing 6214
 
10.0%
gb 4091
 
6.6%
fr 3939
 
6.4%
de 2254
 
3.6%
it 2168
 
3.5%
ca 1765
 
2.8%
jp 1648
 
2.7%
es 964
 
1.6%
Other values (154) 11529
18.6%

Most occurring characters

ValueCountFrequency (%)
S 23041
 
11.1%
U 23024
 
11.1%
s 18739
 
9.0%
16457
 
7.9%
i 12622
 
6.1%
, 10243
 
4.9%
R 6686
 
3.2%
M 6660
 
3.2%
u 6408
 
3.1%
n 6408
 
3.1%
Other values (32) 78047
37.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 105321
50.6%
Lowercase Letter 76314
36.6%
Space Separator 16457
 
7.9%
Other Punctuation 10243
 
4.9%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 23041
21.9%
U 23024
21.9%
R 6686
 
6.3%
M 6660
 
6.3%
B 4982
 
4.7%
E 4752
 
4.5%
G 4448
 
4.2%
F 4342
 
4.1%
I 4010
 
3.8%
A 3136
 
3.0%
Other values (16) 20240
19.2%
Lowercase Letter
ValueCountFrequency (%)
s 18739
24.6%
i 12622
16.5%
u 6408
 
8.4%
n 6408
 
8.4%
e 6311
 
8.3%
a 6214
 
8.1%
l 6214
 
8.1%
v 6214
 
8.1%
g 6214
 
8.1%
o 388
 
0.5%
Other values (4) 582
 
0.8%
Space Separator
ValueCountFrequency (%)
16457
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10243
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 181635
87.2%
Common 26700
 
12.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 23041
 
12.7%
U 23024
 
12.7%
s 18739
 
10.3%
i 12622
 
6.9%
R 6686
 
3.7%
M 6660
 
3.7%
u 6408
 
3.5%
n 6408
 
3.5%
e 6311
 
3.5%
a 6214
 
3.4%
Other values (30) 65522
36.1%
Common
ValueCountFrequency (%)
16457
61.6%
, 10243
38.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 208335
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S 23041
 
11.1%
U 23024
 
11.1%
s 18739
 
9.0%
16457
 
7.9%
i 12622
 
6.1%
, 10243
 
4.9%
R 6686
 
3.2%
M 6660
 
3.2%
u 6408
 
3.1%
n 6408
 
3.1%
Other values (32) 78047
37.5%

ReleaseDate
Categorical

Distinct17334
Distinct (%)38.1%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
2008-01-01
 
136
2009-01-01
 
121
2007-01-01
 
118
2005-01-01
 
111
2006-01-01
 
101
Other values (17329)
44889 

Length

Max length13
Median length10
Mean length10.006597
Min length10

Characters and Unicode

Total characters455060
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8570 ?
Unique (%)18.8%

Sample

1st row1995-10-30
2nd row1995-12-15
3rd row1995-12-22
4th row1995-12-22
5th row1995-02-10

Common Values

ValueCountFrequency (%)
2008-01-01 136
 
0.3%
2009-01-01 121
 
0.3%
2007-01-01 118
 
0.3%
2005-01-01 111
 
0.2%
2006-01-01 101
 
0.2%
NoReleaseDate 100
 
0.2%
2002-01-01 96
 
0.2%
2004-01-01 90
 
0.2%
2001-01-01 84
 
0.2%
2003-01-01 76
 
0.2%
Other values (17324) 44443
97.7%

Length

2023-06-13T12:12:19.792424image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2008-01-01 136
 
0.3%
2009-01-01 121
 
0.3%
2007-01-01 118
 
0.3%
2005-01-01 111
 
0.2%
2006-01-01 101
 
0.2%
noreleasedate 100
 
0.2%
2002-01-01 96
 
0.2%
2004-01-01 90
 
0.2%
2001-01-01 84
 
0.2%
2003-01-01 76
 
0.2%
Other values (17324) 44443
97.7%

Most occurring characters

ValueCountFrequency (%)
0 97600
21.4%
- 90752
19.9%
1 84054
18.5%
2 52803
11.6%
9 39773
8.7%
3 15435
 
3.4%
8 15279
 
3.4%
6 15021
 
3.3%
5 14836
 
3.3%
7 14289
 
3.1%
Other values (10) 15218
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 363008
79.8%
Dash Punctuation 90752
 
19.9%
Lowercase Letter 1000
 
0.2%
Uppercase Letter 300
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 97600
26.9%
1 84054
23.2%
2 52803
14.5%
9 39773
11.0%
3 15435
 
4.3%
8 15279
 
4.2%
6 15021
 
4.1%
5 14836
 
4.1%
7 14289
 
3.9%
4 13918
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
e 400
40.0%
a 200
20.0%
l 100
 
10.0%
s 100
 
10.0%
t 100
 
10.0%
o 100
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
N 100
33.3%
R 100
33.3%
D 100
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 90752
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 453760
99.7%
Latin 1300
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 97600
21.5%
- 90752
20.0%
1 84054
18.5%
2 52803
11.6%
9 39773
8.8%
3 15435
 
3.4%
8 15279
 
3.4%
6 15021
 
3.3%
5 14836
 
3.3%
7 14289
 
3.1%
Latin
ValueCountFrequency (%)
e 400
30.8%
a 200
15.4%
N 100
 
7.7%
R 100
 
7.7%
l 100
 
7.7%
s 100
 
7.7%
D 100
 
7.7%
t 100
 
7.7%
o 100
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 455060
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 97600
21.4%
- 90752
19.9%
1 84054
18.5%
2 52803
11.6%
9 39773
8.7%
3 15435
 
3.4%
8 15279
 
3.4%
6 15021
 
3.3%
5 14836
 
3.3%
7 14289
 
3.1%
Other values (10) 15218
 
3.3%

Revenue
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6863
Distinct (%)15.1%
Missing97
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean11229357
Minimum0
Maximum2.7879651 × 109
Zeros37972
Zeros (%)83.5%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:20.021455image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile48018459
Maximum2.7879651 × 109
Range2.7879651 × 109
Interquartile range (IQR)0

Descriptive statistics

Standard deviation64387893
Coefficient of variation (CV)5.7338897
Kurtosis237.09288
Mean11229357
Median Absolute Deviation (MAD)0
Skewness12.255124
Sum5.0957698 × 1011
Variance4.1458008 × 1015
MonotonicityNot monotonic
2023-06-13T12:12:20.251424image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 37972
83.5%
12000000 20
 
< 0.1%
10000000 19
 
< 0.1%
11000000 19
 
< 0.1%
2000000 18
 
< 0.1%
6000000 17
 
< 0.1%
5000000 14
 
< 0.1%
500000 13
 
< 0.1%
8000000 13
 
< 0.1%
14000000 12
 
< 0.1%
Other values (6853) 7262
 
16.0%
(Missing) 97
 
0.2%
ValueCountFrequency (%)
0 37972
83.5%
1 12
 
< 0.1%
2 3
 
< 0.1%
3 9
 
< 0.1%
4 4
 
< 0.1%
5 5
 
< 0.1%
6 2
 
< 0.1%
7 4
 
< 0.1%
8 5
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
2787965087 1
< 0.1%
2068223624 1
< 0.1%
1845034188 1
< 0.1%
1519557910 1
< 0.1%
1513528810 1
< 0.1%
1506249360 1
< 0.1%
1405403694 1
< 0.1%
1342000000 1
< 0.1%
1274219009 1
< 0.1%
1262886337 1
< 0.1%

Runtime
Real number (ℝ)

Distinct353
Distinct (%)0.8%
Missing346
Missing (%)0.8%
Infinite0
Infinite (%)0.0%
Mean94.181675
Minimum0
Maximum1256
Zeros1535
Zeros (%)3.4%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:20.496947image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile12
Q185
median95
Q3107
95-th percentile138
Maximum1256
Range1256
Interquartile range (IQR)22

Descriptive statistics

Standard deviation38.341059
Coefficient of variation (CV)0.4070968
Kurtosis93.925543
Mean94.181675
Median Absolute Deviation (MAD)11
Skewness4.4907363
Sum4250419
Variance1470.0368
MonotonicityNot monotonic
2023-06-13T12:12:20.756509image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
90 2549
 
5.6%
0 1535
 
3.4%
100 1470
 
3.2%
95 1410
 
3.1%
93 1214
 
2.7%
96 1104
 
2.4%
92 1079
 
2.4%
94 1062
 
2.3%
91 1055
 
2.3%
88 1030
 
2.3%
Other values (343) 31622
69.5%
ValueCountFrequency (%)
0 1535
3.4%
1 107
 
0.2%
2 33
 
0.1%
3 48
 
0.1%
4 50
 
0.1%
5 51
 
0.1%
6 72
 
0.2%
7 103
 
0.2%
8 78
 
0.2%
9 63
 
0.1%
ValueCountFrequency (%)
1256 1
< 0.1%
1140 2
< 0.1%
931 1
< 0.1%
925 1
< 0.1%
900 1
< 0.1%
877 1
< 0.1%
874 1
< 0.1%
840 2
< 0.1%
780 1
< 0.1%
720 1
< 0.1%

Tagline
Categorical

HIGH CARDINALITY  MISSING  UNIFORM 

Distinct20269
Distinct (%)99.4%
Missing25078
Missing (%)55.1%
Memory size355.4 KiB
Based on a true story.
 
7
Trust no one.
 
4
Be careful what you wish for.
 
4
-
 
4
How far would you go?
 
3
Other values (20264)
20376 

Length

Max length297
Median length204
Mean length46.999314
Min length1

Characters and Unicode

Total characters958692
Distinct characters170
Distinct categories17 ?
Distinct scripts6 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20163 ?
Unique (%)98.8%

Sample

1st rowRoll the dice and unleash the excitement!
2nd rowStill Yelling. Still Fighting. Still Ready for Love.
3rd rowFriends are the people who let you be yourself... and never let you forget it.
4th rowJust When His World Is Back To Normal... He's In For The Surprise Of His Life!
5th rowA Los Angeles Crime Saga

Common Values

ValueCountFrequency (%)
Based on a true story. 7
 
< 0.1%
Trust no one. 4
 
< 0.1%
Be careful what you wish for. 4
 
< 0.1%
- 4
 
< 0.1%
How far would you go? 3
 
< 0.1%
Drama 3
 
< 0.1%
Classic Albums 3
 
< 0.1%
There are two sides to every love story. 3
 
< 0.1%
There is no turning back 3
 
< 0.1%
Documentary 3
 
< 0.1%
Other values (20259) 20361
44.8%
(Missing) 25078
55.1%

Length

2023-06-13T12:12:21.037667image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
the 10998
 
6.3%
a 6815
 
3.9%
of 4404
 
2.5%
to 3584
 
2.1%
is 2796
 
1.6%
in 2693
 
1.5%
and 2682
 
1.5%
you 2389
 
1.4%
1582
 
0.9%
for 1523
 
0.9%
Other values (15100) 134470
77.3%

Most occurring characters

ValueCountFrequency (%)
153686
16.0%
e 94412
 
9.8%
t 57267
 
6.0%
o 56566
 
5.9%
a 51473
 
5.4%
n 47498
 
5.0%
i 46036
 
4.8%
r 44992
 
4.7%
s 42360
 
4.4%
h 37172
 
3.9%
Other values (160) 327230
34.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 680479
71.0%
Space Separator 153686
 
16.0%
Uppercase Letter 74991
 
7.8%
Other Punctuation 44585
 
4.7%
Decimal Number 2687
 
0.3%
Dash Punctuation 1944
 
0.2%
Final Punctuation 98
 
< 0.1%
Open Punctuation 56
 
< 0.1%
Close Punctuation 55
 
< 0.1%
Currency Symbol 37
 
< 0.1%
Other values (7) 74
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 94412
13.9%
t 57267
 
8.4%
o 56566
 
8.3%
a 51473
 
7.6%
n 47498
 
7.0%
i 46036
 
6.8%
r 44992
 
6.6%
s 42360
 
6.2%
h 37172
 
5.5%
l 30174
 
4.4%
Other values (43) 172529
25.4%
Other Letter
ValueCountFrequency (%)
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (24) 24
70.6%
Uppercase Letter
ValueCountFrequency (%)
T 10009
 
13.3%
A 6874
 
9.2%
S 5652
 
7.5%
H 4402
 
5.9%
I 4387
 
5.9%
E 4306
 
5.7%
W 3681
 
4.9%
O 3477
 
4.6%
N 3195
 
4.3%
L 3194
 
4.3%
Other values (20) 25814
34.4%
Other Punctuation
ValueCountFrequency (%)
. 26647
59.8%
! 5784
 
13.0%
' 5674
 
12.7%
, 4226
 
9.5%
? 1161
 
2.6%
" 582
 
1.3%
148
 
0.3%
: 138
 
0.3%
& 83
 
0.2%
* 42
 
0.1%
Other values (7) 100
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 802
29.8%
1 516
19.2%
2 299
 
11.1%
3 208
 
7.7%
9 208
 
7.7%
5 168
 
6.3%
4 140
 
5.2%
6 121
 
4.5%
7 121
 
4.5%
8 104
 
3.9%
Math Symbol
ValueCountFrequency (%)
+ 5
35.7%
= 5
35.7%
| 2
 
14.3%
~ 1
 
7.1%
1
 
7.1%
Dash Punctuation
ValueCountFrequency (%)
- 1927
99.1%
9
 
0.5%
8
 
0.4%
Final Punctuation
ValueCountFrequency (%)
82
83.7%
15
 
15.3%
» 1
 
1.0%
Initial Punctuation
ValueCountFrequency (%)
14
73.7%
4
 
21.1%
« 1
 
5.3%
Open Punctuation
ValueCountFrequency (%)
( 49
87.5%
[ 7
 
12.5%
Close Punctuation
ValueCountFrequency (%)
) 48
87.3%
] 7
 
12.7%
Other Number
ValueCountFrequency (%)
½ 2
66.7%
² 1
33.3%
Modifier Letter
ValueCountFrequency (%)
ˌ 1
50.0%
ˈ 1
50.0%
Space Separator
ValueCountFrequency (%)
153686
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 37
100.0%
Nonspacing Mark
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 755470
78.8%
Common 203187
 
21.2%
Han 21
 
< 0.1%
Tamil 5
 
< 0.1%
Hiragana 5
 
< 0.1%
Katakana 4
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 94412
 
12.5%
t 57267
 
7.6%
o 56566
 
7.5%
a 51473
 
6.8%
n 47498
 
6.3%
i 46036
 
6.1%
r 44992
 
6.0%
s 42360
 
5.6%
h 37172
 
4.9%
l 30174
 
4.0%
Other values (73) 247520
32.8%
Common
ValueCountFrequency (%)
153686
75.6%
. 26647
 
13.1%
! 5784
 
2.8%
' 5674
 
2.8%
, 4226
 
2.1%
- 1927
 
0.9%
? 1161
 
0.6%
0 802
 
0.4%
" 582
 
0.3%
1 516
 
0.3%
Other values (42) 2182
 
1.1%
Han
ValueCountFrequency (%)
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (11) 11
52.4%
Tamil
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Hiragana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 958262
> 99.9%
Punctuation 280
 
< 0.1%
None 110
 
< 0.1%
CJK 21
 
< 0.1%
Tamil 5
 
< 0.1%
Hiragana 5
 
< 0.1%
Katakana 4
 
< 0.1%
IPA Ext 2
 
< 0.1%
Modifier Letters 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
153686
16.0%
e 94412
 
9.9%
t 57267
 
6.0%
o 56566
 
5.9%
a 51473
 
5.4%
n 47498
 
5.0%
i 46036
 
4.8%
r 44992
 
4.7%
s 42360
 
4.4%
h 37172
 
3.9%
Other values (78) 326800
34.1%
Punctuation
ValueCountFrequency (%)
148
52.9%
82
29.3%
15
 
5.4%
14
 
5.0%
9
 
3.2%
8
 
2.9%
4
 
1.4%
None
ValueCountFrequency (%)
é 18
16.4%
ä 16
14.5%
ö 8
 
7.3%
á 6
 
5.5%
ó 6
 
5.5%
ü 5
 
4.5%
í 5
 
4.5%
ı 5
 
4.5%
· 4
 
3.6%
ć 3
 
2.7%
Other values (26) 34
30.9%
IPA Ext
ValueCountFrequency (%)
ə 2
100.0%
Tamil
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
CJK
ValueCountFrequency (%)
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (11) 11
52.4%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Modifier Letters
ValueCountFrequency (%)
ˌ 1
50.0%
ˈ 1
50.0%
Hiragana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

Title
Categorical

HIGH CARDINALITY  UNIFORM 

Distinct42197
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
NoTitle
 
100
Cinderella
 
11
Alice in Wonderland
 
9
Hamlet
 
9
Les Misérables
 
8
Other values (42192)
45339 

Length

Max length105
Median length79
Mean length16.680447
Min length1

Characters and Unicode

Total characters758560
Distinct characters287
Distinct categories17 ?
Distinct scripts7 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39869 ?
Unique (%)87.7%

Sample

1st rowToy Story
2nd rowJumanji
3rd rowGrumpier Old Men
4th rowWaiting to Exhale
5th rowFather of the Bride Part II

Common Values

ValueCountFrequency (%)
NoTitle 100
 
0.2%
Cinderella 11
 
< 0.1%
Alice in Wonderland 9
 
< 0.1%
Hamlet 9
 
< 0.1%
Les Misérables 8
 
< 0.1%
Beauty and the Beast 8
 
< 0.1%
Treasure Island 7
 
< 0.1%
A Christmas Carol 7
 
< 0.1%
The Three Musketeers 7
 
< 0.1%
Blackout 7
 
< 0.1%
Other values (42187) 45303
99.6%

Length

2023-06-13T12:12:21.330667image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
the 14555
 
10.7%
of 4930
 
3.6%
a 2241
 
1.6%
in 1693
 
1.2%
and 1631
 
1.2%
to 1054
 
0.8%
757
 
0.6%
man 665
 
0.5%
love 664
 
0.5%
for 601
 
0.4%
Other values (24354) 107490
78.9%

Most occurring characters

ValueCountFrequency (%)
90827
 
12.0%
e 76351
 
10.1%
a 48940
 
6.5%
o 45771
 
6.0%
n 40817
 
5.4%
r 40018
 
5.3%
i 39864
 
5.3%
t 36822
 
4.9%
s 29519
 
3.9%
h 28516
 
3.8%
Other values (277) 281115
37.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 534634
70.5%
Uppercase Letter 117465
 
15.5%
Space Separator 90827
 
12.0%
Other Punctuation 10489
 
1.4%
Decimal Number 3850
 
0.5%
Dash Punctuation 981
 
0.1%
Close Punctuation 87
 
< 0.1%
Open Punctuation 85
 
< 0.1%
Final Punctuation 38
 
< 0.1%
Other Letter 25
 
< 0.1%
Other values (7) 79
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 76351
14.3%
a 48940
9.2%
o 45771
 
8.6%
n 40817
 
7.6%
r 40018
 
7.5%
i 39864
 
7.5%
t 36822
 
6.9%
s 29519
 
5.5%
h 28516
 
5.3%
l 26024
 
4.9%
Other values (121) 121992
22.8%
Uppercase Letter
ValueCountFrequency (%)
T 16119
13.7%
S 10336
 
8.8%
M 8031
 
6.8%
B 7659
 
6.5%
C 7165
 
6.1%
A 6785
 
5.8%
D 6335
 
5.4%
L 5872
 
5.0%
H 5170
 
4.4%
W 5166
 
4.4%
Other values (65) 38827
33.1%
Other Letter
ValueCountFrequency (%)
چ 2
 
8.0%
ه 2
 
8.0%
ی 2
 
8.0%
ک 2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
ª 1
 
4.0%
Other values (11) 11
44.0%
Other Punctuation
ValueCountFrequency (%)
: 3717
35.4%
' 2505
23.9%
. 1603
15.3%
, 1134
 
10.8%
! 647
 
6.2%
& 458
 
4.4%
? 269
 
2.6%
/ 79
 
0.8%
* 19
 
0.2%
# 13
 
0.1%
Other values (8) 45
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 861
22.4%
1 697
18.1%
0 616
16.0%
3 482
12.5%
9 230
 
6.0%
4 229
 
5.9%
5 225
 
5.8%
7 193
 
5.0%
8 161
 
4.2%
6 156
 
4.1%
Math Symbol
ValueCountFrequency (%)
+ 17
70.8%
× 3
 
12.5%
1
 
4.2%
= 1
 
4.2%
1
 
4.2%
1
 
4.2%
Other Number
ValueCountFrequency (%)
½ 12
63.2%
² 3
 
15.8%
³ 2
 
10.5%
1
 
5.3%
1
 
5.3%
Other Symbol
ValueCountFrequency (%)
° 3
37.5%
2
25.0%
1
 
12.5%
1
 
12.5%
1
 
12.5%
Currency Symbol
ValueCountFrequency (%)
$ 18
85.7%
¢ 2
 
9.5%
£ 1
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 966
98.5%
15
 
1.5%
Close Punctuation
ValueCountFrequency (%)
) 82
94.3%
] 5
 
5.7%
Open Punctuation
ValueCountFrequency (%)
( 80
94.1%
[ 5
 
5.9%
Final Punctuation
ValueCountFrequency (%)
37
97.4%
1
 
2.6%
Initial Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
90827
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Format
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 651584
85.9%
Common 106436
 
14.0%
Cyrillic 346
 
< 0.1%
Greek 170
 
< 0.1%
Arabic 11
 
< 0.1%
Katakana 8
 
< 0.1%
Han 5
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 76351
 
11.7%
a 48940
 
7.5%
o 45771
 
7.0%
n 40817
 
6.3%
r 40018
 
6.1%
i 39864
 
6.1%
t 36822
 
5.7%
s 29519
 
4.5%
h 28516
 
4.4%
l 26024
 
4.0%
Other values (107) 238942
36.7%
Common
ValueCountFrequency (%)
90827
85.3%
: 3717
 
3.5%
' 2505
 
2.4%
. 1603
 
1.5%
, 1134
 
1.1%
- 966
 
0.9%
2 861
 
0.8%
1 697
 
0.7%
! 647
 
0.6%
0 616
 
0.6%
Other values (50) 2863
 
2.7%
Cyrillic
ValueCountFrequency (%)
е 32
 
9.2%
о 32
 
9.2%
а 29
 
8.4%
н 24
 
6.9%
и 23
 
6.6%
р 22
 
6.4%
к 17
 
4.9%
с 15
 
4.3%
в 14
 
4.0%
л 14
 
4.0%
Other values (38) 124
35.8%
Greek
ValueCountFrequency (%)
α 20
 
11.8%
ι 14
 
8.2%
ο 14
 
8.2%
τ 9
 
5.3%
λ 8
 
4.7%
ά 8
 
4.7%
ρ 8
 
4.7%
ν 7
 
4.1%
π 6
 
3.5%
η 6
 
3.5%
Other values (32) 70
41.2%
Katakana
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Arabic
ValueCountFrequency (%)
چ 2
18.2%
ه 2
18.2%
ی 2
18.2%
ک 2
18.2%
س 1
9.1%
ا 1
9.1%
ج 1
9.1%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 756995
99.8%
None 1124
 
0.1%
Cyrillic 346
 
< 0.1%
Punctuation 62
 
< 0.1%
Arabic 11
 
< 0.1%
Katakana 8
 
< 0.1%
CJK 5
 
< 0.1%
Misc Symbols 3
 
< 0.1%
Letterlike Symbols 2
 
< 0.1%
Math Operators 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
90827
 
12.0%
e 76351
 
10.1%
a 48940
 
6.5%
o 45771
 
6.0%
n 40817
 
5.4%
r 40018
 
5.3%
i 39864
 
5.3%
t 36822
 
4.9%
s 29519
 
3.9%
h 28516
 
3.8%
Other values (76) 279550
36.9%
None
ValueCountFrequency (%)
é 218
19.4%
ä 127
 
11.3%
ö 55
 
4.9%
è 53
 
4.7%
ô 44
 
3.9%
ü 39
 
3.5%
ó 37
 
3.3%
á 35
 
3.1%
ı 35
 
3.1%
í 33
 
2.9%
Other values (108) 448
39.9%
Punctuation
ValueCountFrequency (%)
37
59.7%
15
24.2%
5
 
8.1%
2
 
3.2%
1
 
1.6%
1
 
1.6%
1
 
1.6%
Cyrillic
ValueCountFrequency (%)
е 32
 
9.2%
о 32
 
9.2%
а 29
 
8.4%
н 24
 
6.9%
и 23
 
6.6%
р 22
 
6.4%
к 17
 
4.9%
с 15
 
4.3%
в 14
 
4.0%
л 14
 
4.0%
Other values (38) 124
35.8%
Arabic
ValueCountFrequency (%)
چ 2
18.2%
ه 2
18.2%
ی 2
18.2%
ک 2
18.2%
س 1
9.1%
ا 1
9.1%
ج 1
9.1%
Misc Symbols
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
50.0%
1
50.0%
Math Operators
ValueCountFrequency (%)
1
50.0%
1
50.0%
Katakana
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Arrows
ValueCountFrequency (%)
1
100.0%

VoteAverage
Real number (ℝ)

Distinct92
Distinct (%)0.2%
Missing100
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean5.62407
Minimum0
Maximum10
Zeros2947
Zeros (%)6.5%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:21.606237image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15
median6
Q36.8
95-th percentile7.8
Maximum10
Range10
Interquartile range (IQR)1.8

Descriptive statistics

Standard deviation1.9154225
Coefficient of variation (CV)0.34057587
Kurtosis2.5420547
Mean5.62407
Median Absolute Deviation (MAD)0.9
Skewness-1.524472
Sum255197.8
Variance3.6688434
MonotonicityNot monotonic
2023-06-13T12:12:21.847588image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2947
 
6.5%
6 2462
 
5.4%
5 1998
 
4.4%
7 1883
 
4.1%
6.5 1722
 
3.8%
6.3 1603
 
3.5%
5.5 1381
 
3.0%
5.8 1369
 
3.0%
6.4 1350
 
3.0%
6.7 1342
 
3.0%
Other values (82) 27319
60.1%
ValueCountFrequency (%)
0 2947
6.5%
0.5 13
 
< 0.1%
0.7 1
 
< 0.1%
1 103
 
0.2%
1.1 1
 
< 0.1%
1.2 4
 
< 0.1%
1.3 13
 
< 0.1%
1.4 5
 
< 0.1%
1.5 30
 
0.1%
1.6 6
 
< 0.1%
ValueCountFrequency (%)
10 185
0.4%
9.8 1
 
< 0.1%
9.6 1
 
< 0.1%
9.5 18
 
< 0.1%
9.4 3
 
< 0.1%
9.3 18
 
< 0.1%
9.2 4
 
< 0.1%
9.1 2
 
< 0.1%
9 158
0.3%
8.9 7
 
< 0.1%

VoteCount
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1820
Distinct (%)4.0%
Missing100
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean110.09644
Minimum0
Maximum14075
Zeros2849
Zeros (%)6.3%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:22.096645image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median10
Q334
95-th percentile434
Maximum14075
Range14075
Interquartile range (IQR)31

Descriptive statistics

Standard deviation491.74289
Coefficient of variation (CV)4.4664741
Kurtosis150.92858
Mean110.09644
Median Absolute Deviation (MAD)8
Skewness10.440782
Sum4995736
Variance241811.07
MonotonicityNot monotonic
2023-06-13T12:12:22.339768image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3242
 
7.1%
2 3127
 
6.9%
0 2849
 
6.3%
3 2785
 
6.1%
4 2478
 
5.4%
5 2097
 
4.6%
6 1747
 
3.8%
7 1570
 
3.5%
8 1359
 
3.0%
9 1194
 
2.6%
Other values (1810) 22928
50.4%
ValueCountFrequency (%)
0 2849
6.3%
1 3242
7.1%
2 3127
6.9%
3 2785
6.1%
4 2478
5.4%
5 2097
4.6%
6 1747
3.8%
7 1570
3.5%
8 1359
3.0%
9 1194
 
2.6%
ValueCountFrequency (%)
14075 1
< 0.1%
12269 1
< 0.1%
12114 1
< 0.1%
12000 1
< 0.1%
11444 1
< 0.1%
11187 1
< 0.1%
10297 1
< 0.1%
10014 1
< 0.1%
9678 1
< 0.1%
9634 1
< 0.1%

ReleaseYear
Real number (ℝ)

Distinct135
Distinct (%)0.3%
Missing100
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean1991.8812
Minimum1874
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:22.612648image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum1874
5-th percentile1941
Q11978
median2001
Q32010
95-th percentile2015
Maximum2020
Range146
Interquartile range (IQR)32

Descriptive statistics

Standard deviation24.05536
Coefficient of variation (CV)0.012076704
Kurtosis0.84010576
Mean1991.8812
Median Absolute Deviation (MAD)12
Skewness-1.2248636
Sum90383601
Variance578.66033
MonotonicityNot monotonic
2023-06-13T12:12:22.861645image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2014 1974
 
4.3%
2015 1905
 
4.2%
2013 1889
 
4.2%
2012 1722
 
3.8%
2011 1667
 
3.7%
2016 1604
 
3.5%
2009 1586
 
3.5%
2010 1501
 
3.3%
2008 1473
 
3.2%
2007 1320
 
2.9%
Other values (125) 28735
63.2%
ValueCountFrequency (%)
1874 1
 
< 0.1%
1878 1
 
< 0.1%
1883 1
 
< 0.1%
1887 1
 
< 0.1%
1888 2
 
< 0.1%
1890 5
 
< 0.1%
1891 6
< 0.1%
1892 3
 
< 0.1%
1893 1
 
< 0.1%
1894 13
< 0.1%
ValueCountFrequency (%)
2020 1
 
< 0.1%
2018 5
 
< 0.1%
2017 532
 
1.2%
2016 1604
3.5%
2015 1905
4.2%
2014 1974
4.3%
2013 1889
4.2%
2012 1722
3.8%
2011 1667
3.7%
2010 1501
3.3%

ReleaseMonth
Real number (ℝ)

Distinct12
Distinct (%)< 0.1%
Missing100
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean6.4590753
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:23.095645image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.6281605
Coefficient of variation (CV)0.56171515
Kurtosis-1.3247729
Mean6.4590753
Median Absolute Deviation (MAD)3
Skewness-0.071880633
Sum293087
Variance13.163548
MonotonicityNot monotonic
2023-06-13T12:12:23.276345image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 5912
13.0%
9 4838
10.6%
10 4615
10.1%
12 3786
8.3%
11 3661
8.1%
3 3553
7.8%
4 3453
7.6%
8 3394
7.5%
5 3339
7.3%
6 3153
6.9%
Other values (2) 5672
12.5%
ValueCountFrequency (%)
1 5912
13.0%
2 3032
6.7%
3 3553
7.8%
4 3453
7.6%
5 3339
7.3%
6 3153
6.9%
7 2640
5.8%
8 3394
7.5%
9 4838
10.6%
10 4615
10.1%
ValueCountFrequency (%)
12 3786
8.3%
11 3661
8.1%
10 4615
10.1%
9 4838
10.6%
8 3394
7.5%
7 2640
5.8%
6 3153
6.9%
5 3339
7.3%
4 3453
7.6%
3 3553
7.8%

Return
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct5232
Distinct (%)11.5%
Missing97
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean659.99915
Minimum0
Maximum12396383
Zeros39998
Zeros (%)88.0%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:23.504992image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2.5353413
Maximum12396383
Range12396383
Interquartile range (IQR)0

Descriptive statistics

Standard deviation74690.825
Coefficient of variation (CV)113.16806
Kurtosis20674.324
Mean659.99915
Median Absolute Deviation (MAD)0
Skewness138.3341
Sum29950101
Variance5.5787194 × 109
MonotonicityNot monotonic
2023-06-13T12:12:23.763954image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 39998
88.0%
1 20
 
< 0.1%
2 12
 
< 0.1%
4 11
 
< 0.1%
5 8
 
< 0.1%
2.5 7
 
< 0.1%
3 7
 
< 0.1%
1.333333333 7
 
< 0.1%
1.5 6
 
< 0.1%
0.25 4
 
< 0.1%
Other values (5222) 5299
 
11.7%
(Missing) 97
 
0.2%
ValueCountFrequency (%)
0 39998
88.0%
5.217391304 × 10-71
 
< 0.1%
7.5 × 10-71
 
< 0.1%
9.375 × 10-71
 
< 0.1%
1.499133126 × 10-61
 
< 0.1%
1.8 × 10-61
 
< 0.1%
1.916666667 × 10-61
 
< 0.1%
3.5 × 10-61
 
< 0.1%
4 × 10-61
 
< 0.1%
5.111111111 × 10-61
 
< 0.1%
ValueCountFrequency (%)
12396383 1
< 0.1%
8500000 1
< 0.1%
4197476.625 1
< 0.1%
2755584 1
< 0.1%
1018619.283 1
< 0.1%
1000000 1
< 0.1%
26881.72043 1
< 0.1%
12890.38667 1
< 0.1%
5330.33945 1
< 0.1%
4133.333333 1
< 0.1%

Director
Categorical

Distinct17573
Distinct (%)38.6%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
[]
 
887
['John Ford']
 
66
['Michael Curtiz']
 
65
['Werner Herzog']
 
54
['Alfred Hitchcock']
 
53
Other values (17568)
44351 

Length

Max length37
Median length33
Mean length17.164153
Min length2

Characters and Unicode

Total characters780557
Distinct characters203
Distinct categories9 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10622 ?
Unique (%)23.4%

Sample

1st row['John Lasseter']
2nd row['Joe Johnston']
3rd row['Howard Deutch']
4th row['Forest Whitaker']
5th row['Charles Shyer']

Common Values

ValueCountFrequency (%)
[] 887
 
2.0%
['John Ford'] 66
 
0.1%
['Michael Curtiz'] 65
 
0.1%
['Werner Herzog'] 54
 
0.1%
['Alfred Hitchcock'] 53
 
0.1%
['Georges Méliès'] 51
 
0.1%
['Woody Allen'] 49
 
0.1%
['Jean-Luc Godard'] 47
 
0.1%
['Sidney Lumet'] 46
 
0.1%
['Charlie Chaplin'] 44
 
0.1%
Other values (17563) 44114
97.0%

Length

2023-06-13T12:12:24.039959image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
john 1165
 
1.2%
974
 
1.0%
michael 879
 
0.9%
robert 806
 
0.9%
david 806
 
0.9%
peter 525
 
0.6%
william 513
 
0.5%
richard 511
 
0.5%
james 489
 
0.5%
paul 439
 
0.5%
Other values (17101) 87607
92.5%

Most occurring characters

ValueCountFrequency (%)
' 89011
 
11.4%
e 52337
 
6.7%
a 51706
 
6.6%
49250
 
6.3%
[ 45476
 
5.8%
] 45476
 
5.8%
r 40797
 
5.2%
n 40267
 
5.2%
i 39006
 
5.0%
o 35375
 
4.5%
Other values (193) 291856
37.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 451377
57.8%
Uppercase Letter 95412
 
12.2%
Other Punctuation 92297
 
11.8%
Space Separator 49250
 
6.3%
Open Punctuation 45478
 
5.8%
Close Punctuation 45478
 
5.8%
Dash Punctuation 1238
 
0.2%
Other Letter 21
 
< 0.1%
Decimal Number 6
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 52337
11.6%
a 51706
11.5%
r 40797
 
9.0%
n 40267
 
8.9%
i 39006
 
8.6%
o 35375
 
7.8%
l 27477
 
6.1%
s 20792
 
4.6%
t 19775
 
4.4%
h 16708
 
3.7%
Other values (97) 107137
23.7%
Uppercase Letter
ValueCountFrequency (%)
M 8356
 
8.8%
S 7933
 
8.3%
J 7211
 
7.6%
R 6167
 
6.5%
B 5973
 
6.3%
C 5961
 
6.2%
A 5717
 
6.0%
D 5104
 
5.3%
L 4950
 
5.2%
G 4566
 
4.8%
Other values (52) 33474
35.1%
Other Letter
ValueCountFrequency (%)
ی 2
 
9.5%
ا 2
 
9.5%
م 2
 
9.5%
ع 1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
ن 1
 
4.8%
پ 1
 
4.8%
Other values (8) 8
38.1%
Other Punctuation
ValueCountFrequency (%)
' 89011
96.4%
. 2885
 
3.1%
" 374
 
0.4%
, 14
 
< 0.1%
\ 12
 
< 0.1%
· 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 3
50.0%
9 1
 
16.7%
5 1
 
16.7%
3 1
 
16.7%
Open Punctuation
ValueCountFrequency (%)
[ 45476
> 99.9%
( 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
] 45476
> 99.9%
) 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
49250
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1238
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 546645
70.0%
Common 233747
29.9%
Cyrillic 144
 
< 0.1%
Arabic 10
 
< 0.1%
Han 8
 
< 0.1%
Hangul 3
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 52337
 
9.6%
a 51706
 
9.5%
r 40797
 
7.5%
n 40267
 
7.4%
i 39006
 
7.1%
o 35375
 
6.5%
l 27477
 
5.0%
s 20792
 
3.8%
t 19775
 
3.6%
h 16708
 
3.1%
Other values (123) 202405
37.0%
Cyrillic
ValueCountFrequency (%)
и 19
13.2%
о 11
 
7.6%
е 11
 
7.6%
л 11
 
7.6%
р 10
 
6.9%
а 10
 
6.9%
к 8
 
5.6%
н 7
 
4.9%
в 6
 
4.2%
д 6
 
4.2%
Other values (26) 45
31.2%
Common
ValueCountFrequency (%)
' 89011
38.1%
49250
21.1%
[ 45476
19.5%
] 45476
19.5%
. 2885
 
1.2%
- 1238
 
0.5%
" 374
 
0.2%
, 14
 
< 0.1%
\ 12
 
< 0.1%
0 3
 
< 0.1%
Other values (6) 8
 
< 0.1%
Han
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Arabic
ValueCountFrequency (%)
ی 2
20.0%
ا 2
20.0%
م 2
20.0%
ع 1
10.0%
ن 1
10.0%
پ 1
10.0%
د 1
10.0%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 776503
99.5%
None 3886
 
0.5%
Cyrillic 144
 
< 0.1%
Arabic 10
 
< 0.1%
CJK 8
 
< 0.1%
Latin Ext Additional 3
 
< 0.1%
Hangul 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
' 89011
 
11.5%
e 52337
 
6.7%
a 51706
 
6.7%
49250
 
6.3%
[ 45476
 
5.9%
] 45476
 
5.9%
r 40797
 
5.3%
n 40267
 
5.2%
i 39006
 
5.0%
o 35375
 
4.6%
Other values (57) 287802
37.1%
None
ValueCountFrequency (%)
é 916
23.6%
á 379
 
9.8%
ö 255
 
6.6%
ó 229
 
5.9%
í 228
 
5.9%
ô 153
 
3.9%
ä 149
 
3.8%
è 134
 
3.4%
ü 108
 
2.8%
ç 106
 
2.7%
Other values (69) 1229
31.6%
Cyrillic
ValueCountFrequency (%)
и 19
13.2%
о 11
 
7.6%
е 11
 
7.6%
л 11
 
7.6%
р 10
 
6.9%
а 10
 
6.9%
к 8
 
5.6%
н 7
 
4.9%
в 6
 
4.2%
д 6
 
4.2%
Other values (26) 45
31.2%
Arabic
ValueCountFrequency (%)
ی 2
20.0%
ا 2
20.0%
م 2
20.0%
ع 1
10.0%
ن 1
10.0%
پ 1
10.0%
د 1
10.0%
CJK
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Latin Ext Additional
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Id
Real number (ℝ)

Distinct45432
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108346
Minimum2
Maximum469172
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size355.4 KiB
2023-06-13T12:12:24.289953image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile5419
Q126443.25
median60002.5
Q3157302
95-th percentile358552.75
Maximum469172
Range469170
Interquartile range (IQR)130858.75

Descriptive statistics

Standard deviation112443.8
Coefficient of variation (CV)1.0378214
Kurtosis0.54902668
Mean108346
Median Absolute Deviation (MAD)44528
Skewness1.2798523
Sum4.9271426 × 109
Variance1.2643607 × 1010
MonotonicityNot monotonic
2023-06-13T12:12:24.528900image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
141971 3
 
< 0.1%
298721 2
 
< 0.1%
9755 2
 
< 0.1%
10991 2
 
< 0.1%
99080 2
 
< 0.1%
152795 2
 
< 0.1%
22649 2
 
< 0.1%
18440 2
 
< 0.1%
5511 2
 
< 0.1%
132641 2
 
< 0.1%
Other values (45422) 45455
> 99.9%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
ValueCountFrequency (%)
469172 1
< 0.1%
468707 1
< 0.1%
468343 1
< 0.1%
467731 1
< 0.1%
465044 1
< 0.1%
464819 1
< 0.1%
464207 1
< 0.1%
464111 1
< 0.1%
463906 1
< 0.1%
463800 1
< 0.1%

MovieCharacter
Categorical

Distinct40180
Distinct (%)88.4%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
NoCharacter
 
2570
Himself
 
516
, , ,
 
211
, , , ,
 
209
, ,
 
141
Other values (40175)
41829 

Length

Max length6647
Median length1773
Mean length168.8468
Min length2

Characters and Unicode

Total characters7678477
Distinct characters618
Distinct categories20 ?
Distinct scripts12 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39945 ?
Unique (%)87.8%

Sample

1st rowWoody (voice), Buzz Lightyear (voice), Mr. Potato Head (voice), Slinky Dog (voice), Rex (voice), Hamm (voice), Bo Peep (voice), Andy (voice), Sid (voice), Mrs. Davis (voice), Sergeant (voice), Hannah (voice), TV Announcer (voice)
2nd rowAlan Parrish, Samuel Alan Parrish / Van Pelt, Judy Sheperd, Peter Shepherd, Sarah Whittle, Nora Shepherd, Carl Bentley, Carol Anne Parrish, Alan Parrish (young), Sarah Whittle (young), Exterminator, Mrs. Thomas the Realtor, Benjamin, Caleb, Billy Jessup, Cop, Bum, Jim Shepherd, Martha Shepherd, Gun Salesman, Paramedic, Paramedic, Girl, Girl, Baker, Pianist
3rd rowMax Goldman, John Gustafson, Ariel Gustafson, Maria Sophia Coletta Ragetti, Melanie Gustafson, Grandpa Gustafson, Jacob Goldman
4th rowSavannah 'Vannah' Jackson, Bernadine 'Bernie' Harris, Gloria 'Glo' Matthews, Robin Stokes, Marvin King, Kenneth Dawkins, John Harris, Sr., Troy, Joseph, James Wheeler
5th rowGeorge Banks, Nina Banks, Franck Eggelhoffer, Annie Banks-MacKenzie, Bryan MacKenzie, Matty Banks, Howard Weinstein, John MacKenzie, Joanna MacKenzie, Dr. Megan Eisenberg, Mr. Habib, Wife Mrs. Habib

Common Values

ValueCountFrequency (%)
NoCharacter 2570
 
5.7%
Himself 516
 
1.1%
, , , 211
 
0.5%
, , , , 209
 
0.5%
, , 141
 
0.3%
, , , , , 129
 
0.3%
Narrator 124
 
0.3%
, 115
 
0.3%
, , , , , , 107
 
0.2%
, , , , , , , 85
 
0.2%
Other values (40170) 41269
90.7%

Length

2023-06-13T12:12:24.807799image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
37208
 
3.5%
uncredited 19404
 
1.8%
himself 14232
 
1.3%
voice 13783
 
1.3%
the 10319
 
1.0%
dr 6831
 
0.6%
mrs 5580
 
0.5%
man 5252
 
0.5%
mr 5177
 
0.5%
girl 4420
 
0.4%
Other values (130038) 947609
88.6%

Most occurring characters

ValueCountFrequency (%)
1028323
 
13.4%
e 649946
 
8.5%
, 525726
 
6.8%
a 519515
 
6.8%
r 472136
 
6.1%
i 419713
 
5.5%
n 404639
 
5.3%
o 360973
 
4.7%
t 288080
 
3.8%
l 273828
 
3.6%
Other values (608) 2735598
35.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4977433
64.8%
Space Separator 1028323
 
13.4%
Uppercase Letter 950790
 
12.4%
Other Punctuation 608315
 
7.9%
Open Punctuation 42194
 
0.5%
Close Punctuation 42154
 
0.5%
Decimal Number 14368
 
0.2%
Dash Punctuation 13905
 
0.2%
Other Letter 632
 
< 0.1%
Final Punctuation 141
 
< 0.1%
Other values (10) 222
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
º 25
 
4.0%
ا 25
 
4.0%
ل 17
 
2.7%
ي 14
 
2.2%
ب 12
 
1.9%
د 9
 
1.4%
ל 9
 
1.4%
ر 9
 
1.4%
و 8
 
1.3%
س 8
 
1.3%
Other values (274) 496
78.5%
Lowercase Letter
ValueCountFrequency (%)
e 649946
13.1%
a 519515
10.4%
r 472136
9.5%
i 419713
 
8.4%
n 404639
 
8.1%
o 360973
 
7.3%
t 288080
 
5.8%
l 273828
 
5.5%
s 242202
 
4.9%
d 174401
 
3.5%
Other values (150) 1172000
23.5%
Uppercase Letter
ValueCountFrequency (%)
M 93697
 
9.9%
S 80372
 
8.5%
C 75971
 
8.0%
B 61447
 
6.5%
D 57096
 
6.0%
H 55392
 
5.8%
P 52540
 
5.5%
A 50399
 
5.3%
G 44764
 
4.7%
L 44614
 
4.7%
Other values (95) 334498
35.2%
Other Punctuation
ValueCountFrequency (%)
, 525726
86.4%
. 34275
 
5.6%
' 26738
 
4.4%
/ 10433
 
1.7%
# 6037
 
1.0%
" 4257
 
0.7%
: 445
 
0.1%
& 268
 
< 0.1%
! 45
 
< 0.1%
? 31
 
< 0.1%
Other values (6) 60
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 5053
35.2%
2 4015
27.9%
3 1445
 
10.1%
4 748
 
5.2%
0 647
 
4.5%
9 568
 
4.0%
5 514
 
3.6%
8 490
 
3.4%
6 458
 
3.2%
7 430
 
3.0%
Nonspacing Mark
ValueCountFrequency (%)
́ 3
21.4%
̂ 3
21.4%
2
14.3%
̀ 1
 
7.1%
ּ 1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 42047
99.7%
[ 121
 
0.3%
23
 
0.1%
3
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 42032
99.7%
] 121
 
0.3%
1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 13876
99.8%
28
 
0.2%
1
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
83
58.9%
» 47
33.3%
11
 
7.8%
Initial Punctuation
ValueCountFrequency (%)
« 47
54.0%
33
37.9%
7
 
8.0%
Other Symbol
ValueCountFrequency (%)
° 24
92.3%
1
 
3.8%
® 1
 
3.8%
Math Symbol
ValueCountFrequency (%)
| 6
50.0%
+ 5
41.7%
< 1
 
8.3%
Modifier Symbol
ValueCountFrequency (%)
` 28
59.6%
´ 19
40.4%
Currency Symbol
ValueCountFrequency (%)
$ 13
86.7%
¢ 2
 
13.3%
Control
ValueCountFrequency (%)
8
88.9%
’ 1
 
11.1%
Format
ValueCountFrequency (%)
2
66.7%
­ 1
33.3%
Other Number
ValueCountFrequency (%)
½ 1
50.0%
² 1
50.0%
Space Separator
ValueCountFrequency (%)
1028323
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5913946
77.0%
Common 1749608
 
22.8%
Cyrillic 14096
 
0.2%
Hangul 223
 
< 0.1%
Greek 212
 
< 0.1%
Arabic 156
 
< 0.1%
Han 117
 
< 0.1%
Hebrew 60
 
< 0.1%
Thai 26
 
< 0.1%
Katakana 23
 
< 0.1%
Other values (2) 10
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 649946
 
11.0%
a 519515
 
8.8%
r 472136
 
8.0%
i 419713
 
7.1%
n 404639
 
6.8%
o 360973
 
6.1%
t 288080
 
4.9%
l 273828
 
4.6%
s 242202
 
4.1%
d 174401
 
2.9%
Other values (150) 2108513
35.7%
Hangul
ValueCountFrequency (%)
7
 
3.1%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
Other values (113) 173
77.6%
Han
ValueCountFrequency (%)
5
 
4.3%
4
 
3.4%
4
 
3.4%
3
 
2.6%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
Other values (77) 89
76.1%
Cyrillic
ValueCountFrequency (%)
а 1497
 
10.6%
о 1125
 
8.0%
и 1040
 
7.4%
е 968
 
6.9%
н 924
 
6.6%
р 909
 
6.4%
т 631
 
4.5%
к 613
 
4.3%
л 600
 
4.3%
в 547
 
3.9%
Other values (55) 5242
37.2%
Common
ValueCountFrequency (%)
1028323
58.8%
, 525726
30.0%
( 42047
 
2.4%
) 42032
 
2.4%
. 34275
 
2.0%
' 26738
 
1.5%
- 13876
 
0.8%
/ 10433
 
0.6%
# 6037
 
0.3%
1 5053
 
0.3%
Other values (50) 15068
 
0.9%
Greek
ValueCountFrequency (%)
α 24
 
11.3%
ς 19
 
9.0%
ο 19
 
9.0%
ρ 14
 
6.6%
σ 9
 
4.2%
τ 8
 
3.8%
η 8
 
3.8%
ν 8
 
3.8%
ά 8
 
3.8%
λ 8
 
3.8%
Other values (32) 87
41.0%
Arabic
ValueCountFrequency (%)
ا 25
16.0%
ل 17
10.9%
ي 14
 
9.0%
ب 12
 
7.7%
د 9
 
5.8%
ر 9
 
5.8%
و 8
 
5.1%
س 8
 
5.1%
ن 7
 
4.5%
ش 6
 
3.8%
Other values (17) 41
26.3%
Hebrew
ValueCountFrequency (%)
ל 9
15.0%
א 7
11.7%
ו 7
11.7%
ה 5
8.3%
י 5
8.3%
ר 4
 
6.7%
ם 4
 
6.7%
ט 3
 
5.0%
ש 3
 
5.0%
נ 2
 
3.3%
Other values (9) 11
18.3%
Thai
ValueCountFrequency (%)
3
11.5%
3
11.5%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%
Katakana
ValueCountFrequency (%)
4
17.4%
4
17.4%
2
8.7%
2
8.7%
2
8.7%
2
8.7%
2
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (2) 2
8.7%
Inherited
ValueCountFrequency (%)
́ 3
42.9%
̂ 3
42.9%
̀ 1
 
14.3%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7645701
99.6%
None 17870
 
0.2%
Cyrillic 14096
 
0.2%
Hangul 223
 
< 0.1%
Punctuation 191
 
< 0.1%
Arabic 156
 
< 0.1%
CJK 117
 
< 0.1%
Hebrew 60
 
< 0.1%
Thai 26
 
< 0.1%
Katakana 23
 
< 0.1%
Other values (4) 14
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1028323
 
13.4%
e 649946
 
8.5%
, 525726
 
6.9%
a 519515
 
6.8%
r 472136
 
6.2%
i 419713
 
5.5%
n 404639
 
5.3%
o 360973
 
4.7%
t 288080
 
3.8%
l 273828
 
3.6%
Other values (80) 2702822
35.4%
None
ValueCountFrequency (%)
é 4956
27.7%
è 1621
 
9.1%
ä 1166
 
6.5%
á 1004
 
5.6%
í 921
 
5.2%
ö 822
 
4.6%
ô 711
 
4.0%
ü 700
 
3.9%
ó 595
 
3.3%
ç 511
 
2.9%
Other values (149) 4863
27.2%
Cyrillic
ValueCountFrequency (%)
а 1497
 
10.6%
о 1125
 
8.0%
и 1040
 
7.4%
е 968
 
6.9%
н 924
 
6.6%
р 909
 
6.4%
т 631
 
4.5%
к 613
 
4.3%
л 600
 
4.3%
в 547
 
3.9%
Other values (55) 5242
37.2%
Punctuation
ValueCountFrequency (%)
83
43.5%
33
 
17.3%
28
 
14.7%
23
 
12.0%
11
 
5.8%
7
 
3.7%
3
 
1.6%
2
 
1.0%
1
 
0.5%
Arabic
ValueCountFrequency (%)
ا 25
16.0%
ل 17
10.9%
ي 14
 
9.0%
ب 12
 
7.7%
د 9
 
5.8%
ر 9
 
5.8%
و 8
 
5.1%
س 8
 
5.1%
ن 7
 
4.5%
ش 6
 
3.8%
Other values (17) 41
26.3%
Hebrew
ValueCountFrequency (%)
ל 9
15.0%
א 7
11.7%
ו 7
11.7%
ה 5
8.3%
י 5
8.3%
ר 4
 
6.7%
ם 4
 
6.7%
ט 3
 
5.0%
ש 3
 
5.0%
נ 2
 
3.3%
Other values (9) 11
18.3%
Hangul
ValueCountFrequency (%)
7
 
3.1%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
Other values (113) 173
77.6%
CJK
ValueCountFrequency (%)
5
 
4.3%
4
 
3.4%
4
 
3.4%
3
 
2.6%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
2
 
1.7%
Other values (77) 89
76.1%
Katakana
ValueCountFrequency (%)
4
17.4%
4
17.4%
2
8.7%
2
8.7%
2
8.7%
2
8.7%
2
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (2) 2
8.7%
Diacriticals
ValueCountFrequency (%)
́ 3
42.9%
̂ 3
42.9%
̀ 1
 
14.3%
Thai
ValueCountFrequency (%)
3
11.5%
3
11.5%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Latin Ext Additional
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

ActorName
Categorical

Distinct42678
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size355.4 KiB
NoName
 
2418
Georges Méliès
 
24
Louis Theroux
 
15
Mel Blanc
 
12
Jimmy Carr
 
9
Other values (42673)
42998 

Length

Max length4551
Median length1414
Mean length187.78079
Min length4

Characters and Unicode

Total characters8539519
Distinct characters395
Distinct categories16 ?
Distinct scripts9 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42472 ?
Unique (%)93.4%

Sample

1st rowTom Hanks, Tim Allen, Don Rickles, Jim Varney, Wallace Shawn, John Ratzenberger, Annie Potts, John Morris, Erik von Detten, Laurie Metcalf, R. Lee Ermey, Sarah Freeman, Penn Jillette
2nd rowRobin Williams, Jonathan Hyde, Kirsten Dunst, Bradley Pierce, Bonnie Hunt, Bebe Neuwirth, David Alan Grier, Patricia Clarkson, Adam Hann-Byrd, Laura Bell Bundy, James Handy, Gillian Barber, Brandon Obray, Cyrus Thiedeke, Gary Joseph Thorup, Leonard Zola, Lloyd Berry, Malcolm Stewart, Annabel Kershaw, Darryl Henriques, Robyn Driscoll, Peter Bryant, Sarah Gilson, Florica Vlad, June Lion, Brenda Lockmuller
3rd rowWalter Matthau, Jack Lemmon, Ann-Margret, Sophia Loren, Daryl Hannah, Burgess Meredith, Kevin Pollak
4th rowWhitney Houston, Angela Bassett, Loretta Devine, Lela Rochon, Gregory Hines, Dennis Haysbert, Michael Beach, Mykelti Williamson, Lamont Johnson, Wesley Snipes
5th rowSteve Martin, Diane Keaton, Martin Short, Kimberly Williams-Paisley, George Newbern, Kieran Culkin, BD Wong, Peter Michael Goetz, Kate McGregor-Stewart, Jane Adams, Eugene Levy, Lori Alan

Common Values

ValueCountFrequency (%)
NoName 2418
 
5.3%
Georges Méliès 24
 
0.1%
Louis Theroux 15
 
< 0.1%
Mel Blanc 12
 
< 0.1%
Jimmy Carr 9
 
< 0.1%
Werner Herzog 8
 
< 0.1%
Louis C.K. 8
 
< 0.1%
George Carlin 8
 
< 0.1%
David Attenborough 8
 
< 0.1%
Trevor Noah 6
 
< 0.1%
Other values (42668) 42960
94.5%

Length

2023-06-13T12:12:25.106645image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
john 9809
 
0.8%
michael 7464
 
0.6%
david 6190
 
0.5%
robert 5725
 
0.5%
james 5693
 
0.5%
richard 4446
 
0.4%
paul 4320
 
0.4%
peter 3903
 
0.3%
william 3432
 
0.3%
george 3416
 
0.3%
Other values (112949) 1113662
95.3%

Most occurring characters

ValueCountFrequency (%)
1122712
 
13.1%
a 707750
 
8.3%
e 668087
 
7.8%
n 524439
 
6.1%
, 519745
 
6.1%
r 497639
 
5.8%
i 484270
 
5.7%
o 426429
 
5.0%
l 366664
 
4.3%
s 256009
 
3.0%
Other values (385) 2965775
34.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 5663898
66.3%
Uppercase Letter 1195912
 
14.0%
Space Separator 1122715
 
13.1%
Other Punctuation 542058
 
6.3%
Dash Punctuation 14112
 
0.2%
Other Letter 543
 
< 0.1%
Decimal Number 94
 
< 0.1%
Final Punctuation 83
 
< 0.1%
Initial Punctuation 23
 
< 0.1%
Open Punctuation 23
 
< 0.1%
Other values (6) 58
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 707750
12.5%
e 668087
11.8%
n 524439
9.3%
r 497639
 
8.8%
i 484270
 
8.6%
o 426429
 
7.5%
l 366664
 
6.5%
s 256009
 
4.5%
t 253361
 
4.5%
h 198021
 
3.5%
Other values (138) 1281229
22.6%
Other Letter
ValueCountFrequency (%)
ا 32
 
5.9%
م 31
 
5.7%
ع 19
 
3.5%
ی 19
 
3.5%
ن 18
 
3.3%
17
 
3.1%
ر 17
 
3.1%
د 17
 
3.1%
ي 16
 
2.9%
12
 
2.2%
Other values (104) 345
63.5%
Uppercase Letter
ValueCountFrequency (%)
M 109410
 
9.1%
S 92377
 
7.7%
C 84052
 
7.0%
J 83374
 
7.0%
B 82422
 
6.9%
A 70859
 
5.9%
R 67418
 
5.6%
D 65916
 
5.5%
L 61183
 
5.1%
G 54690
 
4.6%
Other values (81) 424211
35.5%
Decimal Number
ValueCountFrequency (%)
5 37
39.4%
0 29
30.9%
2 8
 
8.5%
1 8
 
8.5%
9 4
 
4.3%
4 2
 
2.1%
3 2
 
2.1%
7 2
 
2.1%
6 1
 
1.1%
8 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 519745
95.9%
. 16060
 
3.0%
' 6097
 
1.1%
" 129
 
< 0.1%
· 9
 
< 0.1%
: 6
 
< 0.1%
& 6
 
< 0.1%
! 5
 
< 0.1%
/ 1
 
< 0.1%
Nonspacing Mark
ValueCountFrequency (%)
́ 10
58.8%
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Final Punctuation
ValueCountFrequency (%)
74
89.2%
6
 
7.2%
» 3
 
3.6%
Space Separator
ValueCountFrequency (%)
1122712
> 99.9%
  3
 
< 0.1%
Initial Punctuation
ValueCountFrequency (%)
20
87.0%
« 3
 
13.0%
Open Punctuation
ValueCountFrequency (%)
14
60.9%
( 9
39.1%
Format
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 14112
100.0%
Control
ValueCountFrequency (%)
21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 6856726
80.3%
Common 1679148
 
19.7%
Cyrillic 3070
 
< 0.1%
Han 276
 
< 0.1%
Arabic 241
 
< 0.1%
Thai 27
 
< 0.1%
Greek 14
 
< 0.1%
Inherited 11
 
< 0.1%
Hangul 6
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 707750
 
10.3%
e 668087
 
9.7%
n 524439
 
7.6%
r 497639
 
7.3%
i 484270
 
7.1%
o 426429
 
6.2%
l 366664
 
5.3%
s 256009
 
3.7%
t 253361
 
3.7%
h 198021
 
2.9%
Other values (163) 2474057
36.1%
Han
ValueCountFrequency (%)
17
 
6.2%
12
 
4.3%
11
 
4.0%
11
 
4.0%
11
 
4.0%
11
 
4.0%
11
 
4.0%
11
 
4.0%
9
 
3.3%
9
 
3.3%
Other values (55) 163
59.1%
Cyrillic
ValueCountFrequency (%)
а 323
 
10.5%
и 315
 
10.3%
о 233
 
7.6%
н 229
 
7.5%
р 215
 
7.0%
е 174
 
5.7%
л 155
 
5.0%
к 136
 
4.4%
т 115
 
3.7%
с 109
 
3.6%
Other values (51) 1066
34.7%
Common
ValueCountFrequency (%)
1122712
66.9%
, 519745
31.0%
. 16060
 
1.0%
- 14112
 
0.8%
' 6097
 
0.4%
" 129
 
< 0.1%
74
 
< 0.1%
5 37
 
< 0.1%
0 29
 
< 0.1%
21
 
< 0.1%
Other values (24) 132
 
< 0.1%
Arabic
ValueCountFrequency (%)
ا 32
13.3%
م 31
12.9%
ع 19
 
7.9%
ی 19
 
7.9%
ن 18
 
7.5%
ر 17
 
7.1%
د 17
 
7.1%
ي 16
 
6.6%
ل 9
 
3.7%
ب 8
 
3.3%
Other values (18) 55
22.8%
Thai
ValueCountFrequency (%)
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (11) 11
40.7%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Greek
ValueCountFrequency (%)
ν 6
42.9%
Ζ 2
 
14.3%
α 2
 
14.3%
ί 2
 
14.3%
ο 2
 
14.3%
Inherited
ValueCountFrequency (%)
́ 10
90.9%
1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8497424
99.5%
None 38289
 
0.4%
Cyrillic 3070
 
< 0.1%
CJK 276
 
< 0.1%
Arabic 241
 
< 0.1%
Punctuation 120
 
< 0.1%
Latin Ext Additional 56
 
< 0.1%
Thai 27
 
< 0.1%
Diacriticals 10
 
< 0.1%
Hangul 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1122712
 
13.2%
a 707750
 
8.3%
e 668087
 
7.9%
n 524439
 
6.2%
, 519745
 
6.1%
r 497639
 
5.9%
i 484270
 
5.7%
o 426429
 
5.0%
l 366664
 
4.3%
s 256009
 
3.0%
Other values (66) 2923680
34.4%
None
ValueCountFrequency (%)
é 9088
23.7%
á 4156
 
10.9%
í 2756
 
7.2%
ô 2332
 
6.1%
ö 2025
 
5.3%
ó 1882
 
4.9%
ü 1495
 
3.9%
ć 1360
 
3.6%
è 1243
 
3.2%
ä 996
 
2.6%
Other values (111) 10956
28.6%
Cyrillic
ValueCountFrequency (%)
а 323
 
10.5%
и 315
 
10.3%
о 233
 
7.6%
н 229
 
7.5%
р 215
 
7.0%
е 174
 
5.7%
л 155
 
5.0%
к 136
 
4.4%
т 115
 
3.7%
с 109
 
3.6%
Other values (51) 1066
34.7%
Punctuation
ValueCountFrequency (%)
74
61.7%
20
 
16.7%
14
 
11.7%
6
 
5.0%
5
 
4.2%
1
 
0.8%
Arabic
ValueCountFrequency (%)
ا 32
13.3%
م 31
12.9%
ع 19
 
7.9%
ی 19
 
7.9%
ن 18
 
7.5%
ر 17
 
7.1%
د 17
 
7.1%
ي 16
 
6.6%
ل 9
 
3.7%
ب 8
 
3.3%
Other values (18) 55
22.8%
CJK
ValueCountFrequency (%)
17
 
6.2%
12
 
4.3%
11
 
4.0%
11
 
4.0%
11
 
4.0%
11
 
4.0%
11
 
4.0%
11
 
4.0%
9
 
3.3%
9
 
3.3%
Other values (55) 163
59.1%
Latin Ext Additional
ValueCountFrequency (%)
15
26.8%
9
16.1%
6
 
10.7%
6
 
10.7%
ế 5
 
8.9%
4
 
7.1%
4
 
7.1%
4
 
7.1%
2
 
3.6%
1
 
1.8%
Diacriticals
ValueCountFrequency (%)
́ 10
100.0%
Thai
ValueCountFrequency (%)
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (11) 11
40.7%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Interactions

2023-06-13T12:12:12.608416image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:52.223184image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:54.496551image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:56.688632image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:59.160602image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:01.333869image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:03.465710image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:05.697524image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:08.138786image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:10.452160image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:12.823418image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:52.462171image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:54.722520image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:56.912584image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:59.399608image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:01.560373image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:03.735752image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:06.112524image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:08.368786image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:10.691941image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:13.015416image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:52.683204image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:54.927534image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:57.117495image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:59.613601image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:01.767371image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:03.953151image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:06.341909image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:08.580815image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:10.911979image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:13.215193image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:52.910186image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:55.154441image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:57.328184image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:59.827609image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:01.975372image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:04.175149image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:06.566880image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:08.789815image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:11.127941image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:13.416280image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:53.131551image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:55.393447image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:57.711607image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:00.043602image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:02.183867image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:04.387149image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:06.806877image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:09.173781image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:11.340569image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:13.622007image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:53.353516image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:55.623444image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:57.918608image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:00.244602image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:02.387871image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:04.599564image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:07.028759image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:09.376818image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:11.536102image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:13.839721image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:53.594625image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:55.845494image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:58.138607image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:00.469635image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:02.616866image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:04.819471image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:07.265726image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:09.595784image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:11.754532image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:14.056747image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:53.837665image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:56.067475image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:58.370630image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:00.695602image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:02.844041image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:05.049470image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:07.493529image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:09.822330image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:11.976592image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:14.263413image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:54.061518image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:56.270981image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:58.682604image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:00.909866image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:03.045707image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:05.264471image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:07.705883image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:10.027159image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:12.193557image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:14.477510image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:54.285515image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:56.487621image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:11:58.955603image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:01.126868image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:03.264722image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:05.490530image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:07.928420image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:10.250255image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
2023-06-13T12:12:12.407766image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/

Correlations

2023-06-13T12:12:25.331646image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
BudgetPopularityRevenueRuntimeVoteAverageVoteCountReleaseYearReleaseMonthReturnIdOriginalLanguage
Budget1.0000.4630.6440.2270.0720.4840.1410.0470.775-0.1860.000
Popularity0.4631.0000.4910.3070.2410.8930.1860.0720.447-0.2790.000
Revenue0.6440.4911.0000.2540.1270.5130.1040.0480.853-0.2180.000
Runtime0.2270.3070.2541.0000.1930.2900.0340.0720.234-0.1610.111
VoteAverage0.0720.2410.1270.1931.0000.318-0.0090.0480.120-0.1200.070
VoteCount0.4840.8930.5130.2900.3181.0000.1970.0630.474-0.2830.000
ReleaseYear0.1410.1860.1040.034-0.0090.1971.000-0.0140.0870.2210.145
ReleaseMonth0.0470.0720.0480.0720.0480.063-0.0141.0000.048-0.0290.047
Return0.7750.4470.8530.2340.1200.4740.0870.0481.000-0.2000.000
Id-0.186-0.279-0.218-0.161-0.120-0.2830.221-0.029-0.2001.0000.046
OriginalLanguage0.0000.0000.0000.1110.0700.0000.1450.0470.0000.0461.000

Missing values

2023-06-13T12:12:14.882539image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
A simple visualization of nullity by column.
2023-06-13T12:12:15.594611image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-06-13T12:12:16.245623image/svg+xmlMatplotlib v3.7.1, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

BudgetGenresOriginalLanguageOverviewPopularityProductionCompaniesProductionCountriesReleaseDateRevenueRuntimeTaglineTitleVoteAverageVoteCountReleaseYearReleaseMonthReturnDirectorIdMovieCharacterActorName
030000000.0Animation, Comedy, FamilyenLed by Woody, Andy's toys live happily in his room until Andy's birthday brings Buzz Lightyear onto the scene. Afraid of losing his place in Andy's heart, Woody plots against Buzz. But when circumstances separate Buzz and Woody from their owner, the duo eventually learns to put aside their differences.21.946943Pixar Animation StudiosUS1995-10-30373554033.081.0NaNToy Story7.75415.01995.010.012.451801['John Lasseter']862Woody (voice), Buzz Lightyear (voice), Mr. Potato Head (voice), Slinky Dog (voice), Rex (voice), Hamm (voice), Bo Peep (voice), Andy (voice), Sid (voice), Mrs. Davis (voice), Sergeant (voice), Hannah (voice), TV Announcer (voice)Tom Hanks, Tim Allen, Don Rickles, Jim Varney, Wallace Shawn, John Ratzenberger, Annie Potts, John Morris, Erik von Detten, Laurie Metcalf, R. Lee Ermey, Sarah Freeman, Penn Jillette
165000000.0Adventure, Fantasy, FamilyenWhen siblings Judy and Peter discover an enchanted board game that opens the door to a magical world, they unwittingly invite Alan -- an adult who's been trapped inside the game for 26 years -- into their living room. Alan's only hope for freedom is to finish the game, which proves risky as all three find themselves running from giant rhinoceroses, evil monkeys and other terrifying creatures.17.015539TriStar Pictures, Teitler Film, Interscope CommunicationsUS1995-12-15262797249.0104.0Roll the dice and unleash the excitement!Jumanji6.92413.01995.012.04.043035['Joe Johnston']8844Alan Parrish, Samuel Alan Parrish / Van Pelt, Judy Sheperd, Peter Shepherd, Sarah Whittle, Nora Shepherd, Carl Bentley, Carol Anne Parrish, Alan Parrish (young), Sarah Whittle (young), Exterminator, Mrs. Thomas the Realtor, Benjamin, Caleb, Billy Jessup, Cop, Bum, Jim Shepherd, Martha Shepherd, Gun Salesman, Paramedic, Paramedic, Girl, Girl, Baker, PianistRobin Williams, Jonathan Hyde, Kirsten Dunst, Bradley Pierce, Bonnie Hunt, Bebe Neuwirth, David Alan Grier, Patricia Clarkson, Adam Hann-Byrd, Laura Bell Bundy, James Handy, Gillian Barber, Brandon Obray, Cyrus Thiedeke, Gary Joseph Thorup, Leonard Zola, Lloyd Berry, Malcolm Stewart, Annabel Kershaw, Darryl Henriques, Robyn Driscoll, Peter Bryant, Sarah Gilson, Florica Vlad, June Lion, Brenda Lockmuller
20.0Romance, ComedyenA family wedding reignites the ancient feud between next-door neighbors and fishing buddies John and Max. Meanwhile, a sultry Italian divorcée opens a restaurant at the local bait shop, alarming the locals who worry she'll scare the fish away. But she's less interested in seafood than she is in cooking up a hot time with Max.11.712900Warner Bros., Lancaster GateUS1995-12-220.0101.0Still Yelling. Still Fighting. Still Ready for Love.Grumpier Old Men6.592.01995.012.00.000000['Howard Deutch']15602Max Goldman, John Gustafson, Ariel Gustafson, Maria Sophia Coletta Ragetti, Melanie Gustafson, Grandpa Gustafson, Jacob GoldmanWalter Matthau, Jack Lemmon, Ann-Margret, Sophia Loren, Daryl Hannah, Burgess Meredith, Kevin Pollak
316000000.0Comedy, Drama, RomanceenCheated on, mistreated and stepped on, the women are holding their breath, waiting for the elusive "good man" to break a string of less-than-stellar lovers. Friends and confidants Vannah, Bernie, Glo and Robin talk it all out, determined to find a better way to breathe.3.859495Twentieth Century Fox Film CorporationUS1995-12-2281452156.0127.0Friends are the people who let you be yourself... and never let you forget it.Waiting to Exhale6.134.01995.012.05.090760['Forest Whitaker']31357Savannah 'Vannah' Jackson, Bernadine 'Bernie' Harris, Gloria 'Glo' Matthews, Robin Stokes, Marvin King, Kenneth Dawkins, John Harris, Sr., Troy, Joseph, James WheelerWhitney Houston, Angela Bassett, Loretta Devine, Lela Rochon, Gregory Hines, Dennis Haysbert, Michael Beach, Mykelti Williamson, Lamont Johnson, Wesley Snipes
40.0ComedyenJust when George Banks has recovered from his daughter's wedding, he receives the news that she's pregnant ... and that George's wife, Nina, is expecting too. He was planning on selling their home, but that's a plan that -- like George -- will have to change with the arrival of both a grandchild and a kid of his own.8.387519Sandollar Productions, Touchstone PicturesUS1995-02-1076578911.0106.0Just When His World Is Back To Normal... He's In For The Surprise Of His Life!Father of the Bride Part II5.7173.01995.02.00.000000['Charles Shyer']11862George Banks, Nina Banks, Franck Eggelhoffer, Annie Banks-MacKenzie, Bryan MacKenzie, Matty Banks, Howard Weinstein, John MacKenzie, Joanna MacKenzie, Dr. Megan Eisenberg, Mr. Habib, Wife Mrs. HabibSteve Martin, Diane Keaton, Martin Short, Kimberly Williams-Paisley, George Newbern, Kieran Culkin, BD Wong, Peter Michael Goetz, Kate McGregor-Stewart, Jane Adams, Eugene Levy, Lori Alan
560000000.0Action, Crime, Drama, ThrillerenObsessive master thief, Neil McCauley leads a top-notch crew on various insane heists throughout Los Angeles while a mentally unstable detective, Vincent Hanna pursues him without rest. Each man recognizes and respects the ability and the dedication of the other even though they are aware their cat-and-mouse game may end in violence.17.924927Regency Enterprises, Forward Pass, Warner Bros.US1995-12-15187436818.0170.0A Los Angeles Crime SagaHeat7.71886.01995.012.03.123947['Michael Mann']949Lt. Vincent Hanna, Neil McCauley, Chris Shiherlis, Nate, Michael Cheritto, Justine Hanna, Eady, Charlene Shiherlis, Sergeant Drucker, Lauren Gustafson, Bosko, Kelso, Richard Torena, Alan Marciano, Detective Casals, Donald Breedan, Trejo, Hugh Benny, Roger Van Zant, Waingro, Elaine Cheritto, Schwartz, Albert Torena, Dr. Bob, Ralph, Anna Trejo, Armoured Guard, Hooker's Mother, Timmons, Shooter at Drive-in, Driver at Drive-in, Officer Bruce, Claudia, Bosko's Date, Sergeant Heinz, Rachel, Captain Jackson, Harry Dieter, Bank Guard, Armoured Truck Driver, Hostage Girl, 1st SIS Detective in the hallway (uncredited), Solenko, Restaurant Manager (uncredited), Castilian Woman (uncredited), Lillian, Construction Clerk, Children's Hospital Doctor, Dominick, Bartender, Casals' Date, Marcia Drucker, Armoured Guard, Basketball Player, Children's Hospital Nurse, Detective, Prostitute, Bar Couple (uncredited), Restaurant Patron (uncredited), Police Woman (uncredited), Grocery Store Employee (uncredited), Cusamano (uncredited), Grocery Store Cop (uncredited), Waitress (uncredited), Bank Guard (uncredited), Ellis (uncredited)Al Pacino, Robert De Niro, Val Kilmer, Jon Voight, Tom Sizemore, Diane Venora, Amy Brenneman, Ashley Judd, Mykelti Williamson, Natalie Portman, Ted Levine, Tom Noonan, Tone Loc, Hank Azaria, Wes Studi, Dennis Haysbert, Danny Trejo, Henry Rollins, William Fichtner, Kevin Gage, Susan Traylor, Jerry Trimble, Ricky Harris, Jeremy Piven, Xander Berkeley, Begonya Plaza, Rick Avery, Hazelle Goodman, Ray Buktenica, Max Daniels, Vince Deadrick Jr., Steven Ford, Farrah Forke, Patricia Healy, Paul Herman, Cindy Katz, Brian Libby, Dan Martin, Mario Roberts, Thomas Rosales, Jr., Yvonne Zima, Mick Gould, Bud Cort, Viviane Vives, Kim Staunton, Martin Ferrero, Brad Baldridge, Andrew Camuccio, Kenny Endoso, Kimberly Flynn, Niki Harris, Bill McIntosh, Rick Marzan, Terry Miller, Daniel O'Haco, Kai Soremekun, Peter Blackwell, Trevor Coppola, Mary Kircher, Darin Mangan, Robert Miranda, Manny Perry, Iva Franks Singer, Tim Werner, Philip Ettington
658000000.0Comedy, RomanceenAn ugly duckling having undergone a remarkable change, still harbors feelings for her crush: a carefree playboy, but not before his business-focused brother has something to say about it.6.677277Paramount Pictures, Scott Rudin Productions, Mirage Enterprises, Sandollar Productions, Constellation Entertainment, Worldwide, Mont Blanc Entertainment GmbHDE, US1995-12-150.0127.0You are cordially invited to the most surprising merger of the year.Sabrina6.2141.01995.012.00.000000['Sydney Pollack']11860Linus Larrabee, Sabrina Fairchild, David Larrabee, Mrs. Ingrid Tyson, Maude Larrabee, Fairchild, Patrick Tyson, Elizabeth Tyson, Mack, Irene, Louis, Scott, Rosa, Joanna, Martine, Linda, Ron, Nurse, Carol, Ticket Taker, Singer at Larrabee Party, Butler, Red Head, Bartender, Kelly, India, Make-Up Assistant, Assistant, Model, Model, Model, Model, Model, Model, Paris Friend, Paris Friend, Paris Friend, Paris Friend, Paris Friend, Paris Friend, Helicopter Pilot, Gulf Stream Pilot, Sheik, Tyson Butler, Mother in Hospital, Father in Hospital, Trainer, Secretary, Moroccan Waiter, Senator, Japanese Businessman (uncredited), Airport Employee (uncredited), Head Butler (uncredited), Businessman in Window (uncredited), Wedding Guest (uncredited), Pizza Patron (uncredited), Ballroom Dancer (uncredited)Harrison Ford, Julia Ormond, Greg Kinnear, Angie Dickinson, Nancy Marchand, John Wood, Richard Crenna, Lauren Holly, Dana Ivey, Fanny Ardant, Patrick Bruel, Paul Giamatti, Miriam Colón, Elizabeth Franz, Valérie Lemercier, Becky Ann Baker, John C. Vennema, Margo Martindale, J. Smith-Cameron, Christine Luneau-Lipton, Michael Dees, Denis Holmes, Jo-Jo Lowe, Ira Wheeler, Philippa Cooper, Ayako Kawahara, François Genty, Guillaume Gallienne, Inés Sastre, Phina Oruche, Andrea Behalikova, Jennifer Herrera, Kristina Kumlin, Eva Linderholm, Carmen Chaplin, Micheline Van de Velde, Joanna Rhodes, Alan Boone, Patrick Forster-Delmas, Kentaro Matsuo, Peter McKernan, Ed Connelly, Ronald L. Schwary, Alvin Lum, Siching Song, Phil Nee, Randy Becker, Susan Browning, Anthony Mondal, Peter Parks, Woodrow Asai, Eric Bruno Borgman, Michael Cline, Christopher Del Gaudio, Philippe Hartmann, Jerry Quinn, Dori Rosenthal
70.0Action, Adventure, Drama, FamilyenA mischievous young boy, Tom Sawyer, witnesses a murder by the deadly Injun Joe. Tom becomes friends with Huckleberry Finn, a boy with no future and no family. Tom has to choose between honoring a friendship or honoring an oath because the town alcoholic is accused of the murder. Tom and Huck go through several adventures trying to retrieve evidence.2.561161Walt Disney PicturesUS1995-12-220.097.0The Original Bad Boys.Tom and Huck5.445.01995.012.00.000000['Peter Hewitt']45325Tom Sawyer, Huck Finn, Becky Thatcher, Muff Potter, Aunt Polly, Injun Joe, TownspersonJonathan Taylor Thomas, Brad Renfro, Rachael Leigh Cook, Michael McShane, Amy Wright, Eric Schweig, Tamara Mello
835000000.0Action, Adventure, ThrillerenInternational action superstar Jean Claude Van Damme teams with Powers Boothe in a Tension-packed, suspense thriller, set against the back-drop of a Stanley Cup game.Van Damme portrays a father whose daughter is suddenly taken during a championship hockey game. With the captors demanding a billion dollars by game's end, Van Damme frantically sets a plan in motion to rescue his daughter and abort an impending explosion before the final buzzer...5.231580Universal Pictures, Imperial Entertainment, Signature EntertainmentUS1995-12-2264350171.0106.0Terror goes into overtime.Sudden Death5.5174.01995.012.01.838576['Peter Hyams']9091Darren Francis Thomas McCord, Joshua Foss, Matthew Hallmark, Vizepräsident Daniel Bender, Tyler, Emily McCordJean-Claude Van Damme, Powers Boothe, Dorian Harewood, Raymond J. Barry, Ross Malinger, Whittni Wright
958000000.0Adventure, Action, ThrillerenJames Bond must unmask the mysterious head of the Janus Syndicate and prevent the leader from utilizing the GoldenEye weapons system to inflict devastating revenge on Britain.14.686036United Artists, Eon ProductionsGB, US1995-11-16352194034.0130.0No limits. No fears. No substitutes.GoldenEye6.61194.01995.011.06.072311['Martin Campbell']710James Bond, Alec Trevelyan, Natalya Fyodorovna Simonova, Xenia Onatopp, Jack Wade, M, General Arkady Grigorovich Ourumov, Valentin Dmitrovich Zukovsky, Boris Grishenko, Defense Minister Dmitri Mishkin, Q, Miss Moneypenny, Bill Tanner, Caroline, Severnaya Duty Officer, Admiral Chuck Farrell, Computer Store Manager, Irina, Anna, Mig PilotPierce Brosnan, Sean Bean, Izabella Scorupco, Famke Janssen, Joe Don Baker, Judi Dench, Gottfried John, Robbie Coltrane, Alan Cumming, Tchéky Karyo, Desmond Llewelyn, Samantha Bond, Michael Kitchen, Serena Gordon, Simon Kunz, Billy J. Mitchell, Constantine Gregory, Minnie Driver, Michelle Arthur, Ravil Isyanov
BudgetGenresOriginalLanguageOverviewPopularityProductionCompaniesProductionCountriesReleaseDateRevenueRuntimeTaglineTitleVoteAverageVoteCountReleaseYearReleaseMonthReturnDirectorIdMovieCharacterActorName
45466NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Jean Yarbrough']84419The Creeper, Steven Morrow, Joan Medford, Police Lt. Larry Brooks, Marcel De Lange, F. Holmes Harmon, Hal Ormiston, Lady of the Streets, Stella McNally, Mr. Samuels, JerryRondo Hatton, Robert Lowery, Virginia Grey, Bill Goodwin, Martin Kosleck, Alan Napier, Howard Freeman, Virginia Christine, Joan Shawlee, Byron Foulger, Syd Saylor
45467NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Ben Rock']390959Debuty Hank Hart, Jeff Patterson, Kathy Patterson, Bill Barnes, Dr. Liam Woblick, Aidan James, Jeff Schoene, Dilva Henry, Vera Tenslue, News Reporter #2, John Huck, Kim Diamond, Miriam Lane, News Reporter #3, Frank Parsons, Dr. Clayton Larson, David Paulson, Bill Dixon, Donald McFerrellTony Abatemarco, Andre Brooks, Mariclare Costello, Bill Dreggors, Apollo Dukakis, Philip Friedman, James Gleason, Dilva Henry, Bari Hochwald, Wendy Hoffman, John Huck, Rachel Moskowitz, Sandy Mulvihill, Roger Nolan, Chris Parnell, Byrne Piven, Richard Sexton, Rich Williams, Ray Xifo
45468NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Ben Rock']289923Branwall, Sarah Didonna, Kyle Brody, Bill Barnes, Rustin Parr, Heather Donahue, Joshua Leonard, Michael C. WilliamsMonty Bane, Lucy Butler, David Grammer, Bill Dreggors, Frank Pastor, Heather Donahue, Joshua Leonard, Michael C. Williams
45469NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Aaron Osborne']222848Kira (as Cassandra Leigh), Daly, Ruggs, Lewis, Billie, Dillon, Reitman, Ice, Announcer, KillaLisa Boyle, Kena Land, Zaneta Polard, Don Yanan, Debra K. Beatty, Mark Sikes, Robert J. Ferrelli, Ellyn Dawn Humphreys, Ron Jeremy, Ben Ramsey
45470NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['John Irvin']30840Sir Robert Hode, Maid Marian, Little John, Sir Miles Folcanet, Baron Roger DaguerrePatrick Bergin, Uma Thurman, David Morrissey, Jürgen Prochnow, Jeroen Krabbé
45471NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Hamid Nematollah']439050, ,Leila Hatami, Kourosh Tahami, Elham Korda
45472NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Lav Diaz']111109Sister Angela, Homer, Crazy Woman/Virgin, Amang Tiburcio, Ex-convict/Dindo, Philosopher, Photographer, Ana/Call Center Woman, Filmmaker/Butcher, Poet of the Rain, Homer's motherAngel Aquino, Perry Dizon, Hazel Orencio, Joel Torre, Bart Guingona, Soliman Cruz , Roeder, Angeli Bayani, Dante Perez, Betty Uy-Regala, Modesta
45473NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Mark L. Lester']67758Emily Shaw, Det. Mark Winston, Jayne Ferré, Alex Tyler, Tony, Frank Bianci, Detective Stan, Kerry Shaw, Peter Quinn, Boyd, Sammy Benetto, Steve, Fred, Artie, Hitman #1, DoormanErika Eleniak, Adam Baldwin, Julie du Page, James Remar, Damian Chapa, Louis Mandylor, Tom Wright, Jeremy Lelliott, James Quattrochi, Jason Widener, Joe Sabatino, Kiko Ellsworth, Don Swayze, Peter Dobson, Darrell Dubovsky
45474NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Yakov Protazanov']227506, , , ,Iwan Mosschuchin, Nathalie Lissenko, Pavel Pavlov, Aleksandr Chabrov, Vera Orlova
45475NaNNoGenreNoLanguageNoOverviewNaNMissingValueNoProductionCountriesNoReleaseDateNaNNaNNaNNoTitleNaNNaNNaNNaNNaN['Daisy Asquith']461257NoCharacterNoName